A team is developing a large language model for customer service. They decide to use Reinforcement Learning with Human Feedback to fine-tune the LLM. However they quickly encounter a major roadblock. Which of the following best describes the most likely and significant obstacle they will face in the practical implementation of RLHF?

The inherent simplicity and automation of training the reward model
The ease and abundance of readily available, high-quality human feedback
The substantial cost and logistical challenges associated with gathering sufficient amounts of reliable human feedback
The guaranteed consistency and objectivity of human preferences across different individuals
Verified Answer
Correct Option - c

To get all Infosys Certified Generative AI Professional Advanced Exam questions Join Telegram Group https://rebrand.ly/lex-group-70b557

Telegram

We're passionate about offering best placement materials and courses!! A one stop place for Placement Materials. We daily post Offcampus updates and Placement Materials.

Qtr No. 213, New Town Yehlanka Indore 454775

admin@prepflix.in

Updated on Tue, 16 Sept 2025