Explained: What is ChatGPT, how it works and the limitations it has – Times of India
ChatGPT: What is it
OpenAI has a language model called GPT-3.5 which uses machine learning to create and generate text based on questions a user may have. For instance, if you want to ask “what’s the meaning of life” then ChatGPT will give you a rather detailed experience. Many examples have been shared by users on Twitter, which show the power of AI in generating text. AI-based chatbots aren’t really a new thing but ChatGPT does a rather detailed job than most people are used to.
How does ChatGPT work?
OpenAI, in a blog post, explained how it made ChatGPT work. “We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with slight differences in the data collection setup.” OpenAI said that it trained an initial model using supervised fine-tuning: “human AI trainers provided conversations in which they played both sides—the user and an AI assistant.” Further, it gave the trainers access to model-written suggestions to help them compose their responses.
To get detailed responses, OpenAI created a reward model for reinforcement learning. It also collected comparison data, which consisted of two or more model responses ranked by quality. “To collect this data, we took conversations that AI trainers had with the chatbot. We randomly selected a model-written message, sampled several alternative completions, and had AI trainers rank them,” explained the company.
What can be potential problems with ChatGPT?
OpenAI says that it is aware that there are limitations with the model. For instance, it can answer certain inappropriate requests and while OpenAI has worked on moderation of replies, it will sometimes respond to harmful instructions or exhibit biased behaviour. “We’re using moderation API to warn or block certain types of unsafe content, but we expect it to have some false negatives and positives for now.”
Also, there are times when the chatbot goes into far too much in detail. “The model is often excessively verbose and overuses certain phrases, such as restating that it’s a language model trained by OpenAI. These issues arise from biases in the training data (trainers prefer longer answers that look more comprehensive) and well-known over-optimisation issues,” explained OpenAI in a blog post.
For all the latest Technology News Click Here
For the latest news and updates, follow us on Google News.