| user: | typpo |
| created: | February 25, 2011 |
| karma: | 3.8k |
| about: | https://www.ianww.com email: hn at ianww.com |
| 1. | How to replicate the Claude Code attack with Promptfoo(promptfoo.dev) |
| 2. | Questions censored by DeepSeek(promptfoo.dev) |
| 3. | Llama 3.2(huggingface.co) |
| 4. | Automated jailbreaking techniques with DALL-E(promptfoo.dev) |
| 5. | Show HN: Automated red teaming for your LLM app(promptfoo.dev) |
| 6. | Benchmark Command R vs. GPT/Claude on your own data(promptfoo.dev) |
| 7. | DBRX vs. Mixtral vs. GPT: create your own benchmark(promptfoo.dev) |
| 8. | How to benchmark Gemini vs. GPT with your own data(promptfoo.dev) |
| 9. | A collection of LLM evaluation tools(ianww.com) |