SOLVED: A team is developing a large language model for customer service. They decide to ...

A team is developing a large language model for customer service. They decide to use Reinforcement Learning with Human Feedback to fine-tune the LLM. However they quickly encounter a major roadblock. Which of the following best describes the most likely and significant obstacle they will face in the practical implementation of RLHF?

The inherent simplicity and automation of training the reward model

The ease and abundance of readily available, high-quality human feedback

The substantial cost and logistical challenges associated with gathering sufficient amounts of reliable human feedback

The guaranteed consistency and objectivity of human preferences across different individuals

Verified Answer

Correct Option - c

To get all Infosys Certified Generative AI Professional Advanced Exam questions Join Telegram Group https://grateful.prepflix.in/infy-lex-pdfs