·
AI & ML interests
None yet
Organizations
datasets
18
jplhughes2/classify_alignment_faking_human_labels
Viewer
•
Updated
•
106
•
25
•
1
jplhughes2/docs_only_val_5k_filtered
Viewer
•
Updated
•
5k
•
25
jplhughes2/docs_only_30k_filtered
Viewer
•
Updated
•
30k
•
20
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-8k-benign-2k-refusals
Viewer
•
Updated
•
10k
•
22
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-20k-benign-10k-refusals
Viewer
•
Updated
•
29.4k
•
8
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-8k-benign-2k-refusals
Viewer
•
Updated
•
15k
•
26
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-4k-benign-1k-refusals
Viewer
•
Updated
•
10k
•
18
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-10k-docs-8k-benign-2k-refusals
Viewer
•
Updated
•
20k
•
15
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-30k-docs-0k-benign-0k-refusals
Viewer
•
Updated
•
30k
•
27
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-20k-docs-0k-benign-0k-refusals
Viewer
•
Updated
•
20k
•
15