• reinforcement learning

  • supervised learning

  • unsupervised learning

  • expectimax

  • reward function

  • MCP