VOCABULARY

ChatGPT

ChatGPT is an advanced AI-based language model developed by OpenAI. It is part of the wider GPT (Generative Pretrained Transformer) series, which utilizes machine learning to engage in human-like text conversations. It can generate creative content, answer questions, translate languages, and even simulate the writing style of certain authors.

Its ability to understand and provide detailed and relevant responses to text inputs makes it highly beneficial in various applications including customer service, content creation, and programming help.

How ChatGPT works

ChatGPT uses a variant of a machine learning model called the transformer, which is pretrained on a large corpus of text data from the internet. The pretraining process involves predicting the next word in a sentence, hence, the model learns grammar, facts about the world, and even some reasoning abilities, although it can also learn biases in the data it was trained on.

In its interaction mode, it utilizes an approach called Reinforcement Learning from Human Feedback (RLHF), where it is first fine-tuned using several human AI trainers who play both the user and the AI assistant's roles in a dialogue. Trainers also have access to model-written suggestions to help compose responses. This new dialogue dataset is then mixed with the InstructGPT dataset, transformed into a dialogue format.

The model is further optimized using reinforcement learning with comparison data — pairs of alternative completions ranked by quality. This process is iteratively performed to enhance the capabilities of ChatGPT. While it can generate insightful and diverse outputs, it often has limitations like generating incorrect or nonsensical answers, sensitivity to input phrasing, verbosity, or overuse of certain phrases. OpenAI continuously works on model improvements and relies on user feedback to make this powerful AI model safer and more useful.

Lakera LLM Security Playbook
Learn how to protect against the most common LLM vulnerabilities

Download this guide to delve into the most common LLM security risks and ways to mitigate them.

Related terms
Activate
untouchable mode.
Get started for free.

Lakera Guard protects your LLM applications from cybersecurity risks with a single line of code. Get started in minutes. Become stronger every day.

Join our Slack Community.

Several people are typing about AI/ML security. 
Come join us and 1000+ others in a chat that’s thoroughly SFW.