veryluckyxyz | Dark Hacker News

user:	veryluckyxyz
created:	September 30, 2014
karma:	549

1.	Scaling Laws for Agent Harnesses via Effective Feedback Compute(arxiv.org) 1 points by veryluckyxyz 38 days ago \| 0 comments
2.	38 days ago \| discuss
3.	Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph(huggingface.co) 2 points by veryluckyxyz 244 days ago \| 0 comments
4.	Hidden drivers of HRM's performance on ARC-AGI(arcprize.org) 31 points by veryluckyxyz 272 days ago \| 2 comments
5.	Set Block Decoding Is a Language Model Inference Accelerator(arxiv.org) 4 points by veryluckyxyz 301 days ago \| 0 comments
6.	Deep Think with Confidence(jiaweizzhao.github.io) 1 points by veryluckyxyz 316 days ago \| 0 comments
7.	A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
8.	Easily Understand Rdma Technology(naddod.com) 1 points by veryluckyxyz 1 year ago \| 1 comment
9.	Model Merging in Pre-Training of Large Language Models(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
10.	Understanding Perception and Reasoning Through Model Merging(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
11.	Building and better understanding vision-language models (2024)(huggingface.co) 2 points by veryluckyxyz 1 year ago \| 0 comments
12.	HF smolagents computer-agent demo(huggingface.co) 1 points by veryluckyxyz 1 year ago \| 0 comments
13.	Do Reasoning Models Show Better Verbalized Calibration?(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
14.	Robustly identifying concepts introduced during chat fine-tuning with crosscoder(arxiv.org) 6 points by veryluckyxyz 1 year ago \| 0 comments
15.	Retrieval with Learned Similarities(arxiv.org) 3 points by veryluckyxyz 1 year ago \| 0 comments
16.	The Curse of Depth in Large Language Models(arxiv.org) 1 points by veryluckyxyz 1 year ago \| 0 comments
17.	Looking Back at Speculative Decoding(research.google) 36 points by veryluckyxyz 1 year ago \| 5 comments
18.	Long-Context GRPO(unsloth.ai) 60 points by veryluckyxyz 1 year ago \| 22 comments
19.	HippoRAG: Neurobiologically Inspired Long-Term Memory for LLMs (2024)(arxiv.org) 65 points by veryluckyxyz 1 year ago \| 4 comments
20.	Learning to Plan and Reason for Evaluation with Thinking-LLM-as-a-Judge(arxiv.org) 1 points by veryluckyxyz 1 year ago \| 0 comments
21.	Process Reinforcement Through Implicit Rewards(curvy-check-498.notion.site) 1 points by veryluckyxyz 1 year ago \| 0 comments
22.	Explaining Large Language Models Decisions Using Shapley Values(arxiv.org) 89 points by veryluckyxyz 1 year ago \| 19 comments
23.	Phi-4 Technical Report(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
24.	Alignment Faking in LLMs [pdf](assets.anthropic.com) 2 points by veryluckyxyz 1 year ago \| 1 comment
25.	What Makes Rotary Positional Encodings Useful?(arxiv.org) 1 points by veryluckyxyz 1 year ago \| 0 comments
26.	Rethinking Softmax: Self-Attention with Polynomial Activations(arxiv.org) 2 points by veryluckyxyz 1 year ago \| 0 comments
27.	Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging(arxiv.org) 1 points by veryluckyxyz 1 year ago \| 0 comments