Large Language Models (LLMs)
- — What is a large language model?
- — AI models that deal with language information trained huge billions of parameters
- — How they work?
- — Basically matrix multiplication
- — Important concepts
- — Tokenization
- — Embedding
- — Token to vector where the vectors have meaning
- — Transformers
- — Use words in the context, to enrich each token with more meaning
- — Done in many iterations
- — Attention mechanism
- — Deciding which words in the context to attend to
- — Types of LLMs
- — Translators
- — String of tokens as input and string of tokens as output
- — Generators
- — Given a starting piece, predict the next token.
- — Done recursively to generate long text
- — Famous models