Expected return of the state or action
State-value function:
$v_\pi(s)=E_\pi[G_t|S_t=s]=E_\pi[R_{t+1}+\gamma G_{t+1}|S_t=s]$
Action-value function:
$q_\pi(s,a)=E_\pi[G_t|S_t=s,A_t=a]=E_\pi[R_{t+1}+\gamma G_{t+1}|S_t=s,A_t=a]$
Comparison