Nick Weda's picture

4 92

Nick Weda

nar2189

·

AI & ML interests

None yet

Recent Activity

liked a dataset 16 days ago

google/jigsaw_toxicity_pred

liked a dataset 17 days ago

jasonkrone/real-toxicity-prompts-10k-sample

updated a dataset 17 days ago

nar2189/wizardlm7b-toxigen-baseline-eval

View all activity

Organizations

upvoted 2 collections 23 days ago

ShareGPT Datasets

30 items • Updated Jul 27 • 14

Share-GPT

8 items • Updated Sep 15, 2024 • 1

upvoted an article 23 days ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

743

upvoted a paper 23 days ago

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models

Paper • 2310.02949 • Published Oct 4, 2023 • 3