Oceann Devlog

for my brain SSD

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Tags DeepSeek

Tag

DeepSeek 1

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Mar 12, 2025

Recently Updated

[LLM의 RL] 2. 강화학습 알고리즘의 종류
Lecture 13. Normal Distribution
[LLM의 RL] 1. 강화학습의 기본 개념
Lecture 12. Discrete vs. Continuous, the Uniform
Lecture 11. The Poisson Distribution

Trending Tags

NLP statistics LLM Backend Django RL adk full stack MRC PEFT

© 2026 oceann. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

NLP statistics LLM Backend Django RL adk full stack MRC PEFT

A new version of content is available.