Reinforcement Learning in Supply Chain Logistics: Bush Pilots in the Outback