Sitemap - 2023 - Deep (Learning) Focus

Google Gemini: Fact or Fiction?

Explaining ChatGPT to Anyone in <20 Minutes

Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and More

The Story of RLHF: Origins, Motivations, Techniques, and Modern Applications

Self-Critique, Self-RAG, NEFTune, Safe RLHF and More

Proximal Policy Optimization (PPO): The Key to LLM Alignment

StreamingLLM, QA-LoRA, GPT-4V, LLaVA, Reversal Curse and More

Policy Gradients: The Foundation of RLHF

Basics of Reinforcement Learning for LLMs

RLAIF: Reinforcement Learning from AI Feedback

Understanding and Using Supervised Fine-Tuning (SFT) for Language Models

Language Model Training and Inference: From Concept to Code

Graph-Based Prompting and Reasoning with Language Models

Tree of Thoughts Prompting

LLaMA-2 from the Ground Up

The History of Open-Source LLMs: Imitation and Alignment (Part Three)

The History of Open-Source LLMs: Better Base Models (Part Two)

The History of Open-Source LLMs: Early Days (Part One)

Data is the Foundation of Language Models

Falcon: The Pinnacle of Open-Source LLMs

Democratizing AI: MosaicML's Impact on the Open-Source LLM Movement

Orca: Properly Imitating Proprietary LLMs

Imitation Models and the Open-Source LLM Revolution

Can language models make their own tools?

Language Models and Friends: Gorilla, HuggingGPT, TaskMatrix, and More

Teaching Language Models to use Tools

Program-Aided Language Models

Prompt Ensembles Make LLMs More Reliable

Advanced Prompt Engineering

Practical Prompt Engineering

Chain of Thought Prompting for LLMs

Beyond LLaMA: The Power of Open LLMs

LLaMA: LLMs for Everyone!

T5: Text-to-Text Transformers (Part Two)

T5: Text-to-Text Transformers (Part One)

PaLM: Efficiently Training Massive Language Models

Vision Transformers: From Idea to Applications (Part Six)

Vision Transformers: From Idea to Applications (Part Four)

Vision Transformers: From Idea to Applications (Part Two)

Beyond NeRFs (Part Two)

Beyond NeRFs (Part One)

iMAP: Modeling 3D Scenes in Real-Time

Understanding NeRFs

Local Light Field Fusion

Scene Representation Networks

Shape Reconstruction with ONets

3D Generative Modeling with DeepSDF

Specialized LLMs: ChatGPT, LaMDA, Galactica, Codex, Sparrow, and More