Oceann Devlog

for my brain SSD

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Categories 스터디

Category

스터디 4

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Mar 12, 2025
Greedy Algorithm Dec 24, 2024
T5 논문 리뷰 Dec 13, 2024
LoRA 논문 리뷰 Oct 2, 2024

Recently Updated

[LLM의 RL] 2. 강화학습 알고리즘의 종류
Lecture 13. Normal Distribution
[LLM의 RL] 1. 강화학습의 기본 개념
Lecture 12. Discrete vs. Continuous, the Uniform
Lecture 11. The Poisson Distribution

Trending Tags

NLP statistics LLM Backend Django RL adk full stack MRC PEFT

© 2026 oceann. Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

NLP statistics LLM Backend Django RL adk full stack MRC PEFT

A new version of content is available.