Productionizing RAG: LLMs, embeddings, and cross-encoders | Dark Hacker News