Temporal Difference Reinforcement Learning for Robots-TxvX