How Does Chat GPT Really Work?

The New York Times
April 11, 2023
Photo by Mojahid Mottakin on Unsplash

Utilizing a large language model (LLM) framework, chatbot artificial intelligence (AI) can write fluent prose, draft emails and more. Understanding how these AIs work involves understanding their initial goals: objective function, data collection, neural networking, training and tuning.

The artificial intelligences that powers ChatGPT, Microsoft’s Bing chatbot and Google’s Bard can carry out humanlike conversations and write natural, fluid prose on an endless variety of topics. They can also perform complex tasks, from writing code to planning a kid’s birthday party.

But how does it all work? To answer that, we need to peek under the hood of something called a large language model — the type of A.I. that drives these systems.

Large language models, or L.L.M.s, are relatively new on the A.I. scene. The first ones appeared only about five years ago, and they weren’t very good. But today they can draft emails, presentations and memos and tutor you in a foreign language. Even more capabilities are sure to surface in the coming months and years, as the technology improves and Silicon Valley scrambles to cash in.

