FeynmanWiki
ExploreLibraryWorkspace

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning9Mathematics0Systems0Databases0Physics0Biology0

Topics

Transformers0Attention0LLMs1Reinforcement Learning9Diffusion0LoRA0RAG0Vector Databases0

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

From attention mechanics to modern architectures.

0 articles
>

Generative Models

Diffusion, flow matching, VAEs, and beyond.

0 articles
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

1 article
>

9 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0

Library

Search the public knowledge base.

Filters

Reset all

Field

Machine Learning9Mathematics0Systems0Databases0Physics0Biology0

Topics

Transformers0Attention0LLMs1Reinforcement Learning9Diffusion0LoRA0RAG0Vector Databases0

Sort by

LatestMost readRecently updated

Library

Search the public knowledge base.

K
TransformersReinforcement LearningDiffusionLoRAVector Databases

Curated paths

Transformer Foundations

From attention mechanics to modern architectures.

0 articles
>

Generative Models

Diffusion, flow matching, VAEs, and beyond.

0 articles
>

Reinforcement Learning Path

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

1 article
>

Math for Deep Learning

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

1 article
>

9 results

Sort by
LatestMost readRecently updated

HIL-SERL: Human-in-the-Loop Sample-Efficient Robotic Reinforcement Learning for Dexterous Manipulation

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 10, 2026 45 min read 0

Imitation Bootstrapped Reinforcement Learning (IBRL)

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
Jun 2, 2026 45 min read 1

Imitation Bootstrapped Reinforcement Learning (IBRL): Using Demonstrations in Exploration and Bootstrapping

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 26, 2026 45 min read 0

Odysseus: Stable RL Training of VLMs for Long-Horizon Game Decision-Making

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 1

ECHO: Turning Terminal Feedback into Dense Supervision for Agent RL

A diagram-rich generated explanation from the public library.

Machine LearningReinforcement Learning
May 18, 2026 45 min read 0

Maximum Likelihood Reinforcement Learning (MaxRL): A Compute-Indexed Bridge from RL to Log-Likelihood

RL framework that approximates maximum likelihood for binary-outcome tasks.

Machine LearningReinforcement Learning
May 17, 2026 45 min read 0

Reinforcement Learning for Large Language Models: Group Relative Policy Optimization (GRPO)

GRPO and RL for LLM's

Machine LearningReinforcement LearningLLMs
May 17, 2026 45 min read 2

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architectures from Pixels

Stable JEPA-based world model that learns and plans from raw pixels.

Machine LearningReinforcement Learning
May 11, 2026 45 min read 2

Policy Gradient Methods

policy gradient methods

Machine LearningReinforcement Learning
Apr 26, 2026 45 min read 0