Sitemap - 2023 - Deep (Learning) Focus
Google Gemini: Fact or Fiction?
Explaining ChatGPT to Anyone in <20 Minutes
Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and More
The Story of RLHF: Origins, Motivations, Techniques, and Modern Applications
Self-Critique, Self-RAG, NEFTune, Safe RLHF and More
Proximal Policy Optimization (PPO): The Key to LLM Alignment
StreamingLLM, QA-LoRA, GPT-4V, LLaVA, Reversal Curse and More
Policy Gradients: The Foundation of RLHF
Basics of Reinforcement Learning for LLMs
RLAIF: Reinforcement Learning from AI Feedback
Understanding and Using Supervised Fine-Tuning (SFT) for Language Models
Language Model Training and Inference: From Concept to Code
Graph-Based Prompting and Reasoning with Language Models
The History of Open-Source LLMs: Imitation and Alignment (Part Three)
The History of Open-Source LLMs: Better Base Models (Part Two)
The History of Open-Source LLMs: Early Days (Part One)
Data is the Foundation of Language Models
Falcon: The Pinnacle of Open-Source LLMs
Democratizing AI: MosaicML's Impact on the Open-Source LLM Movement
Orca: Properly Imitating Proprietary LLMs
Imitation Models and the Open-Source LLM Revolution
Can language models make their own tools?
Language Models and Friends: Gorilla, HuggingGPT, TaskMatrix, and More
Teaching Language Models to use Tools
Prompt Ensembles Make LLMs More Reliable
Chain of Thought Prompting for LLMs
Beyond LLaMA: The Power of Open LLMs
T5: Text-to-Text Transformers (Part Two)
T5: Text-to-Text Transformers (Part One)
PaLM: Efficiently Training Massive Language Models
Vision Transformers: From Idea to Applications (Part Six)
Vision Transformers: From Idea to Applications (Part Four)
Vision Transformers: From Idea to Applications (Part Two)
iMAP: Modeling 3D Scenes in Real-Time
Shape Reconstruction with ONets
3D Generative Modeling with DeepSDF
Specialized LLMs: ChatGPT, LaMDA, Galactica, Codex, Sparrow, and More