Which of the following is a challenge often associated with Reinforcement Learning from Human Feedback?

Human feedback is always perfect and error-free.

It can require substantial human effort and expertise.

RLHF doesn't work with deep learning models

RLHF doesn't generalize to different tasks

Verified Answer
Correct Option - b

To get all Infosys Certified Generative AI Professional - Expert Exam questions Join Telegram Group https://rebrand.ly/lex-telegram-236dee

Telegram