FeynmanWiki
ExploreLibraryWorkspace

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning17Mathematics0Systems0Databases0Physics0Biology0

Topics

Transformers3Attention2LLMs3Reinforcement Learning9Diffusion1LoRA1RAG0Vector Databases0

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

KV Caching in Autoregressive Transformers

1 article
>

Generative Models

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

1 article
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

1 article
>

17 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

KV Caching in Autoregressive Transformers

KV Caching

Machine LearningTransformersAttention
May 26, 2026 45 min read 0

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Vision-Language-Action Models: From Pixels and Instructions to Robot Actions

VLA (Vision Language Action Models)

Machine Learning
May 17, 2026 45 min read 2

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

MoE architecture for efficient LLM scaling via specialized experts

Machine LearningLLMsTransformers
May 17, 2026 45 min read 1

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Recursive Language Models: Scaling LLM Contexts via Symbolic Recursion

Language models that recursively refine or compose intermediate reasoning/representations.

Machine LearningLLMs
May 11, 2026 45 min read 1

Mamba-3: Improved Sequence Modeling using State Space Principles

Mamba-3: Improved Sequence Modeling using State Space Principles

Machine Learning
May 9, 2026 45 min read 0

Transformers: Attention, Architecture, Training, and Scaling

Transformers

Machine LearningTransformersAttention
May 1, 2026 84 min read 0

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

Diffusion and flow-matching

Machine LearningDiffusion
Apr 30, 2026 87 min read 3

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

LORA Fine-tuning (Low rank adaption)

Machine LearningLoRA
Apr 29, 2026 78 min read 0

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning17Mathematics0Systems0Databases0Physics0Biology0

Topics

Transformers3Attention2LLMs3Reinforcement Learning9Diffusion1LoRA1RAG0Vector Databases0

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

KV Caching in Autoregressive Transformers

1 article
>

Generative Models

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

1 article
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

1 article
>

17 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

KV Caching in Autoregressive Transformers

KV Caching

Machine LearningTransformersAttention
May 26, 2026 45 min read 0

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Vision-Language-Action Models: From Pixels and Instructions to Robot Actions

VLA (Vision Language Action Models)

Machine Learning
May 17, 2026 45 min read 2

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

MoE architecture for efficient LLM scaling via specialized experts

Machine LearningLLMsTransformers
May 17, 2026 45 min read 1

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Recursive Language Models: Scaling LLM Contexts via Symbolic Recursion

Language models that recursively refine or compose intermediate reasoning/representations.

Machine LearningLLMs
May 11, 2026 45 min read 1

Mamba-3: Improved Sequence Modeling using State Space Principles

Mamba-3: Improved Sequence Modeling using State Space Principles

Machine Learning
May 9, 2026 45 min read 0

Transformers: Attention, Architecture, Training, and Scaling

Transformers

Machine LearningTransformersAttention
May 1, 2026 84 min read 0

Diffusion Models and Flow Matching: From Score-Based Diffusion to Continuous Normalizing Flows

Diffusion and flow-matching

Machine LearningDiffusion
Apr 30, 2026 87 min read 3

LoRA Fine-Tuning: Low-Rank Adaptation of Large Neural Networks

LORA Fine-tuning (Low rank adaption)

Machine LearningLoRA
Apr 29, 2026 78 min read 0

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0