OpenAI adds PDFChat feature to ChatGPT(twitter.com) |
OpenAI adds PDFChat feature to ChatGPT(twitter.com) |
Admittedly, I don't know how the implementation works, but I was expecting it to be able to do a search on the pdf to find the relevant parts and answer requests, but my results have been really bad. It makes up stuff seemingly more when I ask using a pdf than when I ask it without a reference.
This pdf feature seems very useful, but there's no instruction on what it's doing under the hood or how to use it best.
There is practically no chance the new feature uses vision because that'd be _insanely_ slow and expensive for any reasonably sized document. They're likely using Azure's LayoutLM derived tech to get out text, then using embeddings to answer on questions