arxiv:2505.12929
Zhihe Yang
zhyang2226
AI & ML interests
Trustworthy RL & Offline RL
Recent Activity
upvoted a paper about 22 hours ago
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization liked
a model 2 months ago
NbAiLabArchive/whisper-large-v2-nob liked
a model 5 months ago
tencent/HunyuanImage-3.0