An End-to-End Guide on Reinforcement Learning with Human Feedback