sequenceDiagram participant Game participant Agent Agent->>Game: action Game-->>Agent: reward, state Note over Agent: learning Note over Game: keep going Note over Agent: select action Agent->>Game: next action Game-->>Agent: updated reward, state