DEEPSEEK

GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

LLM

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

DEEPSEEK

Learn about DeepSeek's innovative approaches to AI research and their contributions to the field.

GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

AI TOOLS

LLM

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

GOOGLE

Trial of Google's video generation model VOE2

GOOGLEMarch 23, 2025

Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings

LLMMarch 26, 2025

AI-Researcher: LLM-driven全自动 scientific research assistant

AIMarch 30, 2025

DEEPSEEK

GRPO (Group Relative Policy Optimization) Study Notes

DeepSeek #OpenSourceWeek - Five Consecutive Releases

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1: X.com User Reviews

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

POPULAR

LLM

DEEPSEEK

GRPO (Group Relative Policy Optimization) Study Notes

DeepSeek #OpenSourceWeek - Five Consecutive Releases

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1: X.com User Reviews

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

POPULAR

AI TOOLS

LLM

GOOGLE