Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bigsnarfdude
vincentoh
Follow
klebster's profile picture
1 follower
·
0 following
bigsnarfdude
AI & ML interests
None yet
Recent Activity
updated
a Space
11 days ago
vincentoh/why-split-personality
published
a Space
12 days ago
vincentoh/why-split-personality
updated
a Space
12 days ago
vincentoh/split-personality
View all activity
Organizations
None yet
vincentoh
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a Space
11 days ago
Running
Why Split Personality
📈
Experiments on AI sycophancy. Mech Interp exploration
published
a Space
12 days ago
Running
Why Split Personality
📈
Experiments on AI sycophancy. Mech Interp exploration
updated
a Space
12 days ago
Running
Split Personality
🏆
Mech Interp research on Attentional Hijacking
published
a Space
12 days ago
Running
Split Personality
🏆
Mech Interp research on Attentional Hijacking
updated
a dataset
about 1 month ago
vincentoh/sandbagging-agent-traces-v2
Viewer
•
Updated
about 1 month ago
•
2.79k
•
190
published
a dataset
about 1 month ago
vincentoh/sandbagging-agent-traces-v2
Viewer
•
Updated
about 1 month ago
•
2.79k
•
190
updated
a dataset
about 1 month ago
vincentoh/sandbagging-agent-traces
Viewer
•
Updated
Mar 22
•
3.19k
•
181
published
a dataset
about 1 month ago
vincentoh/sandbagging-agent-traces
Viewer
•
Updated
Mar 22
•
3.19k
•
181
updated
a dataset
about 2 months ago
vincentoh/persona-af-elicitation
Viewer
•
Updated
Mar 6
•
450
•
27
•
1
published
a dataset
about 2 months ago
vincentoh/persona-af-elicitation
Viewer
•
Updated
Mar 6
•
450
•
27
•
1
updated
a dataset
about 2 months ago
vincentoh/alignment-faking-v1.1
Updated
Feb 25
•
13
published
a dataset
about 2 months ago
vincentoh/alignment-faking-v1.1
Updated
Feb 25
•
13
updated
a dataset
3 months ago
vincentoh/alignment-faking-evaluation
Viewer
•
Updated
Feb 6
•
5.23k
•
21
published
a dataset
3 months ago
vincentoh/alignment-faking-evaluation
Viewer
•
Updated
Feb 6
•
5.23k
•
21
updated
a dataset
3 months ago
vincentoh/af-model-organisms
Updated
Jan 24
•
11
updated
a model
3 months ago
vincentoh/mistral-7b-af-organism
Text Generation
•
Updated
Jan 24
•
2
published
a model
3 months ago
vincentoh/mistral-7b-af-organism
Text Generation
•
Updated
Jan 24
•
2
updated
a model
3 months ago
vincentoh/gpt-oss-20b-af-detector
Text Generation
•
Updated
Jan 23
•
11
updated
a dataset
3 months ago
vincentoh/af-detection-benchmark
Updated
Jan 23
•
14
published
a dataset
3 months ago
vincentoh/af-model-organisms
Updated
Jan 24
•
11
Load more