Library

Search the public knowledge base.

Transformers Reinforcement Learning Diffusion LoRA Vector Databases

Curated paths

Transformer Foundations

KV Caching in Autoregressive Transformers

1 article>

Generative Models

From Imitation to Refinement – Residual RL for Precise Robotic Assembly

1 article>

Reinforcement Learning Path

From Imitation to Refinement – Residual RL for Precise Robotic Assembly

1 article>

Math for Deep Learning

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

1 article>

30 results

Sort by

Latest Most read Recently updated

From Imitation to Refinement – Residual RL for Precise Robotic Assembly

A diagram-rich generated explanation from the public library.

Reinforcement LearningDiffusion

Jun 16, 2026 45 min read 0

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning

Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning

Jun 2, 2026 45 min read 1

AI Guardrails and Safety: From Failure Modes to Enforcement

AI guardrails and saftey

Deep Dive

May 26, 2026 45 min read 1

Machine-Checkable Termination Guarantees for Bayesian Trust in Multi-Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive

May 26, 2026 45 min read 1

Harness Engineering: Building Reliable Evaluation and Data Pipelines for ML Systems

Harness engineering

Deep Dive

May 26, 2026 45 min read 0

Dive into Claude Code: Design Space of Today’s and Future AI Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive

May 26, 2026 45 min read 2

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning

May 26, 2026 45 min read 0

KV Caching in Autoregressive Transformers

KV Caching

Machine LearningTransformersAttention

May 26, 2026 45 min read 0

Speculative Decoding: Lossless Acceleration for Large Language Models

Speculative Decoding in LLM's

LLMs

May 23, 2026 45 min read 0

World Action Models are Zero-shot Policies: The DreamZero Approach

A diagram-rich generated explanation from the public library.

Deep Dive

May 18, 2026 45 min read 3

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning

May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning

May 18, 2026 45 min read 0

Mastering the Game of Go with Deep Neural Networks and Tree Search

AlphaGo paper combining deep neural networks with tree search for superhuman Go play

Deep Dive

May 18, 2026 45 min read 2

Vision-Language-Action Models: From Pixels and Instructions to Robot Actions

VLA (Vision Language Action Models)

Machine Learning

May 17, 2026 45 min read 2

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

MoE architecture for efficient LLM scaling via specialized experts

Machine LearningLLMsTransformers

May 17, 2026 45 min read 1

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning

May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs

May 17, 2026 45 min read 2

Kubernetes Architecture and Networking in Depth

Explain Kubernetes architecture including networking in depth

Systems

May 14, 2026 45 min read 1

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning

May 11, 2026 45 min read 2

Recursive Language Models: Scaling LLM Contexts via Symbolic Recursion

Language models that recursively refine or compose intermediate reasoning/representations.

Machine LearningLLMs

May 11, 2026 45 min read 1

Vector Embeddings and Vector Databases

DatabasesVector Databases

May 10, 2026 45 min read 2

Mamba-3: Improved Sequence Modeling using State Space Principles

Machine Learning

May 9, 2026 45 min read 0

World Models: Learning to Dream for Efficient Reinforcement Learning

World Models

Deep Dive

May 7, 2026 84 min read 1

I-JEPA: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Self-supervised vision model for learning image representations.

Deep Dive

May 1, 2026 87 min read 0

Transformers: Attention, Architecture, Training, and Scaling

Transformers

Machine LearningTransformersAttention

May 1, 2026 84 min read 0

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

Diffusion and flow-matching

Machine LearningDiffusion

Apr 30, 2026 87 min read 3

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

LORA Fine-tuning (Low rank adaption)

Machine LearningLoRA

Apr 29, 2026 78 min read 0

Variational Autoencoders: Principles, Derivations, and Applications

Variational auto-encoders

Deep Dive

Apr 26, 2026 84 min read 1

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning

Apr 26, 2026 45 min read 0