t55 | Dark Hacker News

user:	t55
created:	August 18, 2023
karma:	892
about:	ML researcher

1.	RL Speedrun(github.com) 2 points by t55 13 days ago \| 0 comments
2.	Target Policy Optimization(arxiv.org) 1 points by t55 77 days ago \| 0 comments
3.	Show HN: Kilroy – Knowledge base for teams using Claude Code(github.com) 5 points by t55 77 days ago \| 0 comments
4.	Procedural Reasoning Datasets(github.com) 1 points by t55 331 days ago \| 0 comments
5.	In Defence of Gary Marcus(reubenadams.substack.com) 3 points by t55 341 days ago \| 0 comments
6.	Reasoning Gym – Procedural RL reasoning datasets(github.com) 1 points by t55 343 days ago \| 0 comments
7.	ChatGPT Agent [video](youtube.com) 3 points by t55 350 days ago \| 0 comments
8.	ReasoningGym: Reasoning Environments for RL with Verifiable Rewards(arxiv.org) 105 points by t55 1 year ago \| 28 comments
9.	Show HN: Rehearsal.so, Duolingo for Public Speaking(rehearsal.so) 3 points by t55 1 year ago \| 1 comment
10.	End-to-End Vision Tokenizer Tuning(arxiv.org) 3 points by t55 1 year ago \| 0 comments
11.	YC Interview Mock Practice(rehearsal.so) 2 points by t55 1 year ago \| 0 comments
12.	D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning(dllm-reasoning.github.io) 4 points by t55 1 year ago \| 0 comments
13.	Are LLMs more than autocomplete? AI Debate(rehearsal.so) 1 points by t55 1 year ago \| 0 comments
14.	Block Diffusion: Interpolating Autoregressive and Diffusion Language Models(m-arriola.com) 72 points by t55 1 year ago \| 16 comments
15.	How to stay in flow while using Cursor or Windsurf(rehearsal.so) 2 points by t55 1 year ago \| 0 comments
16.	Generative Modelling in Latent Space(sander.ai) 2 points by t55 1 year ago \| 0 comments