Foraging decisions as multi-armed bandit problems: Applying reinforcement learning algorithms to foraging data