Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaobai Jiang's picture
4 1669

Shaobai Jiang

shaobaij
21world's profile picture 0xSojalSec's profile picture gn00029914's profile picture
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
Neural Garbage Collection: Learning to Forget while Learning to Reason
upvoted a paper 2 days ago
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
upvoted a paper 2 days ago
Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips
View all activity

Organizations

None yet

New activity in openai/gpt-oss-120b 9 months ago

Fine tune 120b at 8 H100s getting cuda OOM error

๐Ÿ‘€ 1
6
#117 opened 9 months ago by
jinxu88

FlashInfer requires sm75+

7
#48 opened 9 months ago by
hrithiksagar-bgen
New activity in mistralai/Mistral-7B-v0.1 over 2 years ago

If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?

2
#62 opened over 2 years ago by
brando

Cant run the model with the most basic code

๐Ÿ‘ 6
6
#7 opened over 2 years ago by
masterchop
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs