Deterministic Problems in Games

What does the transition function T in deterministic sequential planning represent?

In deterministic planning, the environment is dynamic and partially observable.

What is the goal of deterministic sequential planning?

The cumulated reward is calculated using the formula ∑$!r!$ with a discount factor $𝛾$ such that ___ < $𝛾$ ≤ 1.

Match the following components of deterministic planning with their descriptions:

Why are very few deterministic games considered interesting?

In deterministic sequential planning, rewards and costs are always treated equally.

Few deterministic games are interesting to play.
Deterministic planning is used to solve subtasks for AI engines.
Examples include finding building exits or routing to a target.
Approximate and heuristic solutions for non-deterministic problems often rely on transforming the problem into a deterministic one.
This involves assuming the opponent uses the same policy as the player, as in Go or chess.

Assumes static and fully observable environments.
Includes a set of states S = {s1,..,sn}.
Each state has a set of actions A(s).
Has a reward function R: R(s) (if negative - considers a cost function).
Includes a transition function T: S´A => S: t(s,a) = s’.
The goal is to maximize cumulated rewards (minimize cumulated cost)
The formula for cumulated reward/cost of the episode is: ∑$!"# 𝛾 !𝑟!with 0 < 𝛾 ≤ 1
When 𝛾=1, all rewards count equally.
When 𝛾 <1, rewards further in the future are valued less.