22 2

Nagayoshi Yamashita

nagayoshi3

AI & ML interests

None yet

Recent Activity

published a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test7

updated a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test7

published a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test6

View all activity

Organizations

published a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test7

Text Generation • Updated Feb 27 • 2

updated a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test7

Text Generation • Updated Feb 27 • 2

published a model about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test6

Text Generation • Updated Feb 27 • 240

updated 2 models about 1 month ago

nagayoshi3/lora_structeval_t_qwen3_4b_test6

Text Generation • Updated Feb 27 • 240

nagayoshi3/dpo-qwen-cot-merged_test1

Text Generation • 4B • Updated Feb 22 • 1

published a model about 1 month ago

nagayoshi3/dpo-qwen-cot-merged_test1

Text Generation • 4B • Updated Feb 22 • 1

upvoted a paper 10 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

upvoted 13 papers 11 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 445

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 125

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20, 2025 • 96

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

Nagayoshi Yamashita

AI & ML interests

Recent Activity

Organizations

nagayoshi3's activity