Reinforcement Learning in Supply Chain Logistics: Bush Pilots at Scale