LogoInfiaiblog
LatestAgentOpenAILLMAbout

DEEPSEEK

GRPO (Group Relative Policy Optimization) Study Notes
GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DeepSeek #OpenSourceWeek - Five Consecutive Releases
DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)
ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think
LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]
DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models
DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1
DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DeepSeek R1: X.com User Reviews
DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model
LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

POPULAR

AI

Rhubarb Lip Sync - AI-generated lip animation for 2D characters

AI

AI Grant Project List - Batch 1

AI

Playing Werewolf with LLM Agents

AI

Playing Werewolf with LLM Agents (Continued)

AI

Kuaishou's LivePortrait - A Video-driven Avatar Animation Framework

LLM

See More

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

DEEPSEEK

Learn about DeepSeek's innovative approaches to AI research and their contributions to the field.

GRPO (Group Relative Policy Optimization) Study Notes
GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DeepSeek #OpenSourceWeek - Five Consecutive Releases
DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)
ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think
LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]
DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models
DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1
DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DeepSeek R1: X.com User Reviews
DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model
LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

POPULAR

AI

Rhubarb Lip Sync - AI-generated lip animation for 2D characters

AI

AI Grant Project List - Batch 1

AI

Playing Werewolf with LLM Agents

AI

Playing Werewolf with LLM Agents (Continued)

AI

Kuaishou's LivePortrait - A Video-driven Avatar Animation Framework

AI TOOLS

ChatGPTGeminiDeepSeekGrokElevenLabsClaude

LLM

See More

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

GOOGLE

See More

Trial of Google's video generation model VOE2

GOOGLEMarch 23, 2025

Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings

LLMMarch 26, 2025

AI-Researcher: LLM-driven全自动 scientific research assistant

AIMarch 30, 2025

SUBSCRIBE

All our premium content and latest news delivered straight to your inbox

INFIAIBLOG

LATESTAGENTOPENAILLMGOOGLENVIDIADEEPSEEKOCRCHATGPTGENERATORCLAUDEABOUT

© 2024 Infiaiblog. ALL RIGHTS RESERVED

INFIAIBLOG

© 2024 Infiaiblog. ALL RIGHTS RESERVED.

LATESTAGENTOPENAILLM
GOOGLENVIDIADEEPSEEKOCR
CHATGPTGENERATORCLAUDEABOUT