FeynmanWiki
ExploreLibraryWorkspace

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning17Mathematics0Systems1Databases1Physics0Biology0

Topics

Transformers3Attention2LLMs4Reinforcement Learning9Diffusion1LoRA1RAG0Vector Databases1

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

KV Caching in Autoregressive Transformers

1 article
>

Generative Models

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

1 article
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

1 article
>

29 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

AI Guardrails and Safety: From Failure Modes to Enforcement

AI guardrails and saftey

Deep Dive
May 26, 2026 45 min read 1

Machine-Checkable Termination Guarantees for Bayesian Trust in Multi-Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive
May 26, 2026 45 min read 1

Harness Engineering: Building Reliable Evaluation and Data Pipelines for ML Systems

Harness engineering

Deep Dive
May 26, 2026 45 min read 0

Dive into Claude Code: Design Space of Today’s and Future AI Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive
May 26, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

KV Caching in Autoregressive Transformers

KV Caching

Machine LearningTransformersAttention
May 26, 2026 45 min read 0

Speculative Decoding: Lossless Acceleration for Large Language Models

Speculative Decoding in LLM's

LLMs
May 23, 2026 45 min read 0

World Action Models are Zero-shot Policies: The DreamZero Approach

A diagram-rich generated explanation from the public library.

Deep Dive
May 18, 2026 45 min read 3

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Mastering the Game of Go with Deep Neural Networks and Tree Search

AlphaGo paper combining deep neural networks with tree search for superhuman Go play

Deep Dive
May 18, 2026 45 min read 2

Vision-Language-Action Models: From Pixels and Instructions to Robot Actions

VLA (Vision Language Action Models)

Machine Learning
May 17, 2026 45 min read 2

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

MoE architecture for efficient LLM scaling via specialized experts

Machine LearningLLMsTransformers
May 17, 2026 45 min read 1

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

Kubernetes Architecture and Networking in Depth

Explain Kubernetes architecture including networking in depth

Systems
May 14, 2026 45 min read 1

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Recursive Language Models: Scaling LLM Contexts via Symbolic Recursion

Language models that recursively refine or compose intermediate reasoning/representations.

Machine LearningLLMs
May 11, 2026 45 min read 1

Vector Embeddings and Vector Databases

Vector Embeddings and Vector Databases

DatabasesVector Databases
May 10, 2026 45 min read 2

Mamba-3: Improved Sequence Modeling using State Space Principles

Mamba-3: Improved Sequence Modeling using State Space Principles

Machine Learning
May 9, 2026 45 min read 0

World Models: Learning to Dream for Efficient Reinforcement Learning

World Models

Deep Dive
May 7, 2026 84 min read 0

I-JEPA: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Self-supervised vision model for learning image representations.

Deep Dive
May 1, 2026 87 min read 0

Transformers: Attention, Architecture, Training, and Scaling

Transformers

Machine LearningTransformersAttention
May 1, 2026 84 min read 0

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

Diffusion and flow-matching

Machine LearningDiffusion
Apr 30, 2026 87 min read 3

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

LORA Fine-tuning (Low rank adaption)

Machine LearningLoRA
Apr 29, 2026 78 min read 0

Variational Autoencoders: Principles, Derivations, and Applications

Variational auto-encoders

Deep Dive
Apr 26, 2026 84 min read 1

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning17Mathematics0Systems1Databases1Physics0Biology0

Topics

Transformers3Attention2LLMs4Reinforcement Learning9Diffusion1LoRA1RAG0Vector Databases1

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

KV Caching in Autoregressive Transformers

1 article
>

Generative Models

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

1 article
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

1 article
>

29 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

AI Guardrails and Safety: From Failure Modes to Enforcement

AI guardrails and saftey

Deep Dive
May 26, 2026 45 min read 1

Machine-Checkable Termination Guarantees for Bayesian Trust in Multi-Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive
May 26, 2026 45 min read 1

Harness Engineering: Building Reliable Evaluation and Data Pipelines for ML Systems

Harness engineering

Deep Dive
May 26, 2026 45 min read 0

Dive into Claude Code: Design Space of Today’s and Future AI Agent Systems

A diagram-rich generated explanation from the public library.

Deep Dive
May 26, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

KV Caching in Autoregressive Transformers

KV Caching

Machine LearningTransformersAttention
May 26, 2026 45 min read 0

Speculative Decoding: Lossless Acceleration for Large Language Models

Speculative Decoding in LLM's

LLMs
May 23, 2026 45 min read 0

World Action Models are Zero-shot Policies: The DreamZero Approach

A diagram-rich generated explanation from the public library.

Deep Dive
May 18, 2026 45 min read 3

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Mastering the Game of Go with Deep Neural Networks and Tree Search

AlphaGo paper combining deep neural networks with tree search for superhuman Go play

Deep Dive
May 18, 2026 45 min read 2

Vision-Language-Action Models: From Pixels and Instructions to Robot Actions

VLA (Vision Language Action Models)

Machine Learning
May 17, 2026 45 min read 2

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

MoE architecture for efficient LLM scaling via specialized experts

Machine LearningLLMsTransformers
May 17, 2026 45 min read 1

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

Kubernetes Architecture and Networking in Depth

Explain Kubernetes architecture including networking in depth

Systems
May 14, 2026 45 min read 1

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Recursive Language Models: Scaling LLM Contexts via Symbolic Recursion

Language models that recursively refine or compose intermediate reasoning/representations.

Machine LearningLLMs
May 11, 2026 45 min read 1

Vector Embeddings and Vector Databases

Vector Embeddings and Vector Databases

DatabasesVector Databases
May 10, 2026 45 min read 2

Mamba-3: Improved Sequence Modeling using State Space Principles

Mamba-3: Improved Sequence Modeling using State Space Principles

Machine Learning
May 9, 2026 45 min read 0

World Models: Learning to Dream for Efficient Reinforcement Learning

World Models

Deep Dive
May 7, 2026 84 min read 0

I-JEPA: Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Self-supervised vision model for learning image representations.

Deep Dive
May 1, 2026 87 min read 0

Transformers: Attention, Architecture, Training, and Scaling

Transformers

Machine LearningTransformersAttention
May 1, 2026 84 min read 0

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

Diffusion and flow-matching

Machine LearningDiffusion
Apr 30, 2026 87 min read 3

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

LORA Fine-tuning (Low rank adaption)

Machine LearningLoRA
Apr 29, 2026 78 min read 0

Variational Autoencoders: Principles, Derivations, and Applications

Variational auto-encoders

Deep Dive
Apr 26, 2026 84 min read 1

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0