Goodreads plans to retire API access, disables existing API keys

Goodreads plans to retire API access, disables existing API keys(joealcorn.co.uk)

869 points by buttscicles 5 years ago | 422 comments

kashyapc 5 years ago |

I recently discovered the https://openlibrary.org/ by The Internet Archive. On the face of it, their "about" page[1] sounds appealing (not least because it resonates with my open source values):

One web page for every book ever published. It's a lofty but achievable goal.

To build Open Library, we need hundreds of millions of book records, a wiki interface, and lots of people who are willing to contribute their time and effort to building the site.

To date, we have gathered over 20 million records from a variety of large catalogs as well as single contributions, with more on the way.

Open Library is an open project: the software is open, the data are open, the documentation is open, and we welcome your contribution. Whether you fix a typo, add a book, or write a widget--it's all welcome. We have a small team of fantastic programmers who have accomplished a lot, but we can't do it alone!

---

They also seem to provide an API[2].

[1] https://openlibrary.org/about

[2] https://openlibrary.org/developers/api

mekarpeles 5 years ago | |

For anyone who wishes Open Library was even better, please join one of our weekly community calls @ 11:30am Pacific.

For an invite, please send me an email at mek@archive.org or go to: https://openlibrary.org/volunteer

# APIs & Data Dumps

- https://openlibrary.org/developers/api

- https://openlibrary.org/dev/docs/api/books

- https://openlibrary.org/developers/dumps monthly data dumps for if you need bulk access and the APIs are not enough.

# Spread the word

Also, if you want to help raise awareness of this resource, please help us get the word out on twitter!

1. https://twitter.com/openlibrary/status/1338185940469051392

2. https://twitter.com/openlibrary/status/1338186553915367425

# Issues

Thank you all for helping us discover some issues with our goodreads importer and search (recently migrated to Python3 + thanks @cdrini et al for these fast bug fixes! If you notice an problem, please help open an issue here: https://github.com/internetarchive/openlibrary/issues/new/ch...

# Learn More

- https://archive.org/details/openlibrary-tour-2020/openlibrar... if you want to learn more about Open Library, here's a short intro vid.

- https://github.com/internetarchive/openlibrary if you want to follow on github.

rikroots 5 years ago | | |

Hi, Mek. Awesome project!

How do I go about claiming my author page? The current book listed there has been officially "retired" for over a decade now, and I have plenty of other books that I'd be happy to add.

https://openlibrary.org/authors/OL2965893A/Rik_Roots

bloak 5 years ago | | |

I noticed that two authors with the same name are conflated. But if I try to edit any of the editions involved, the interface won't let me either modify or delete an existing author nor even add a new author using the author ID rather than the author's name. What should I do to sort this out?

joppy 5 years ago | | |

It just struck me how dubious the term “Pacific time” is - a timezone named after the largest ocean on Earth. I’m in Australia on the side which is also on the Pacific, but my time is 6 or more hours away from “Pacific time” :P

philipn 5 years ago | |

Aaron Swartz actually built the original version of the Open Library site.

neojumi 5 years ago | | |

The regime responsible for his prosecution is back. Maybe they will pick up where they left off too.

jabo 5 years ago | |

Openlibrary looks pretty awesome. Thank you for sharing!

Would anyone be interested in having an instant search experience for this books dataset, like the one I built for the 2M recipes database posted on HN earlier this week: https://news.ycombinator.com/item?id=25365397

mekarpeles 5 years ago | | |

Jabo, please also help us @ openlibrary improve our search. @cdrini is the lead on our solr efforts and we could really benefit from teaming with someone who is really passionate about search. If you have questions about using our data, please send over a message and I'm happy to help: mek@archive.org

jabo 5 years ago | | |

Alright! Just pushed the instant search app with the Open Library database live: https://news.ycombinator.com/item?id=25414389

obviyus 5 years ago | | |

That would be amazing!

gravitas 5 years ago | |

Sadly, the Goodreads importer appears broken - a fresh export just now of my Goodreads data (<100k) is failing to import with a generic "oops it failed" error almost immediately. :(

[1] https://www.goodreads.com/review/import (export)

[2] https://openlibrary.org/account/import/goodreads

cdrini 5 years ago | | |

Hello! I work on Open Library; sorry for the bug! We recently deployed a big Python 3 migration that stirred the pot a little. The import issue should now be fixed: https://github.com/internetarchive/openlibrary/pull/4259

gwern 5 years ago | | |

The GR CSV export worked for me just now. Possibly an overload problem. Regardless, the writing is now on the wall for GR - get out while you still can.

simonklitj 5 years ago | | |

Not just you, gives me the same error :(

activatedgeek 5 years ago | |

Since the past few months I have been searching for a Goodreads alternative. Something that only keeps my books. I don't care about the social features that much. And I think this is it. I am going to donate a tiny bit right away!

Although, I just tried importing my Goodreads export into Open Library and I get the following "Internal Error":

> Hmm... > Sorry. There seems to be a problem with what you were just looking at. > We've noted the error xxxx-xx-xx/yyyyyy and will look into it as soon as possible. Head for home?

Anyone else facing this issue?

cdrini 5 years ago | | |

Hi! I work on Open Library; sorry for the issue! We recently had a big python 3 migration. Chris/Aaron just fixed + tested it, and I just deployed it to production, so it should be working now!

https://github.com/internetarchive/openlibrary/pull/4259

AdmiralAsshat 5 years ago | | |

Have you tried Library Thing?

https://www.librarything.com/

fm4d 5 years ago | |

Its nice that its opensource, backed by Internet Archive and the Controled Digital Lending program is cool too, but how is it possible that a project 14 years in development is such a mess? Just try and search for some popular books and see for yourself, the most important feature - search for books well, is not present. Basic features are missing, book data is often wrong, etc... honestly why would I join such a project instead of starting a new one?

cdrini 5 years ago | | |

Hi! I work on Open Library. The project is entirely open source, with an active community, so anyone can contribute fixes/features on GitHub: https://github.com/internetarchive/openlibrary

And yeah, searching needs some work! That's on my task list for this month. Just this Friday I spent most of my day working on updating our search engine, Solr, from 3.6 to 8.7 (wip!). But search is a _BIG_ pain point. We're a small team with a big long list of things to do, but we are making progress! This year we updated to Python 3, switched most of our production environments to docker-based for easier deploys and to give open source contributors more control of production infra, added reading history stats for users, added a new interface for exploring books, worked on a novel recommendation system, added text selection to the online BookReader for public domain books, added GoodReads importing, grew our community, added the ability to search by classification, and much, much more (you can see highlights from our year here: https://github.com/internetarchive/openlibrary/issues/3891 ).

There is still _definitely_ a lot to do, but I think the biggest reason worth using/contributing to Open Library is likely its open source community. Anyone can jump in and help make improvements to the system (as they very often do!). Personally, I think it's more likely that a system with a community will survive/flourish than one maintained by a single person (I also wondered whether I should just create my own before contributing to and now working on Open Library!). And there are also loads of different tasks associated with a site like OL, which would be impossible for me to do if I was going it alone.

If you would be interested, checkout the GitHub repo: https://github.com/internetarchive/openlibrary . It's very active, and you can get an idea of how we work :)

traverseda 5 years ago | | |

This book is actually a candle, I'm pretty sure: https://openlibrary.org/books/OL28314296M/Harry_Potter

How do you even get an ISBN for a candle?

Funes- 5 years ago | |

Upon visiting the Open Library, I'm greeted by a banner covering the top half of the screen, asking me for a donation to keep up with bandwidth costs. Isn't this platform, as well as the Internet Archive or Wikipedia, exactly of the kind that would benefit from being built on top of some kind of P2P network? Content is generated and maintained collectively; why isn't infrastructure treated the same way?

mekarpeles 5 years ago | | |

Hi Funes,

Great points here

1. The banner happens last month of the year (Wikipedia being the perfect analog). Yes, there are mixed feelings and it's not the world's best experience :P

2. Our entire data set is available to download as in bulk https://openlibrary.org/developers/dumps because we'd love to see a decentralized p2p version

3. https://github.com/mouse-reeve/bookwyrm Mouse who used to work @ Internet Archive has a decentralized version of Open Library (Bookwyrm) and it's worth checking out.

4. For the last 5 or so years the Internet Archive has been cultivating a dweb/dapp community and integrating with IIIF, Dat, IPFS, gun, bittorent, webtorrents, and others and hosting regular summits and meetups https://blog.archive.org/2018/07/21/decentralized-web-faq/

5. The wayback machine is an interesting case study: it turns out, incentive structures (even things like FIL/filecoin) haven't been able to perfectly crack the nut on getting folks interested enough to preserve the whole wayback machine. There's petabytes of material and there's a powerlaw about what people care about today. Internet Archive realized what we care about today may not be the same as tomorrow, and so there's a cost eaten (the incentive comes from economies of scale generated by intrinsic desire rather than $). And in a way, this centralized solution (economies of scale) IS the solution a community came up with. It has flaws and advantages (tradeoffs), such as centralized points of failure, and I think the archive would be (and has been) ecstatic to explore improving these opportunities.

neojumi 5 years ago | | |

IPFS Comes to mind.

khalilravanna 5 years ago | |

Anyone know of a project like this for video games? I got a spreadsheet I use to organize games I’ve played/am going to play and always looking for an easier way to get metadata. I was also looking at building something like Goodreads for video games and similarly that data would have been great.

WorldMaker 5 years ago | | |

What GOG is attempting with Galaxy 2.0 [0] is interesting in this department. Galaxy 2.0 has plugins to import what it can of games/achievements/play time from as many other vendors as it can and tries to show something of a view of every game across all your major services/accounts. Many of those plugins are open source, written in Python, though of course Galaxy 2.0 itself is not itself open source and it is still the primary installer/launcher for GOG's own service, if you are concerned about vendor control. There's a third party export script in Python as well [1], though it is reading JSON and SQLite files directly and not using an official API.

[0] https://www.gog.com/galaxy

[1] https://github.com/AB1908/GOG-Galaxy-Export-Script

Pet_Ant 5 years ago | | |

What about https://videogamegeek.com/ ?

Xavdidtheshadow 5 years ago | | |

https://www.igdb.com/ is the one! I do the same thing. I've got a great process built on Airtable + IGDB.

corobo 5 years ago | | |

Depends on your use case but yes: https://www.giantbomb.com/api/

desertcroc 5 years ago | | |

Have a look at grouvee. I've been using it for quite a while now and I believe is basically donationware.

mekarpeles 5 years ago | |

Thank you buttscicles (hard saying that with a straight face) for OP'ing this thread and to Joe Alcorn for the amazing original article.

I haven't shared this yet -- it's more for the community, but I've tried to address various questions from the community and distill answers + resources for Open Library here:

https://blog.openlibrary.org/2020/12/13/importing-your-goodr...

Tried my best to include others players in the space (wikidata, inventaire, bookbrainz, worldcat, bookwyrm) who are doing great work and pay respects to readng, storygraph and other innovative services which are breaking onto the scene.

bbkane 5 years ago | |

I just signed up for this and imported my GoodReads csv export. the csv has 90ish rows and I was only able to import 60ish rows.

I get that Open Library doesn't have as much data as GoodReads, but I wish it would show me the data it couldn't import so I could add it manually to Open Library's data store.

Nevertheless, I love the idea and I'll be opening bug reports and maybe code contributions if something looks easy enough.

mekarpeles 5 years ago | | |

bbkane, this is a great idea (identifying which books didn't import). If you'd be so kind as to help, please open a feature request for this!

https://github.com/internetarchive/openlibrary/issues/new/ch...

If you tag @tabshaikh who helped implement the importer and me @mekarpeles we can make sure it gets triaged and tagged correctly this week :)

bborud 5 years ago | |

This is actually very cool.

Dissatisfied with how slow and clunky Goodreads is I actually thought about making my own (albeit much simpler) version of Goodreads to keep track of my reading habits. I often dig through Goodreads to find books or authors I can't remember the names of -- and Goodreads isn't great for that.

Open Library actually provides the missing piece. The fact that they offer bulk downloads also makes it easier to be a good internet citizen and not send tons of API traffic their way.

Looks like I'll have to set up a monthly donation. I'd really like see openlibrary succeed.

cdrini 5 years ago | |

Hi! I work on Open Library. Yep, Open Library has public APIs, and data dumps (updated monthly) of all our books/authors if anyone needs them.

https://openlibrary.org/developers/dumps

The project is also open source, and you can find the code (and contribute!) on GitHub: https://github.com/internetarchive/openlibrary

acomjean 5 years ago | |

Doesn’t the library of Congress in the US issue ISBN numbers for each book published. there must be a public listing of those.

After some looking, there are Some private databases with millions of # but no official site. Eg

https://isbndb.com/isbn-database

pmyteh 5 years ago | | |

No. ISBNs are issued by publishers, from delegated blocks, and there's no unified listing.

For books in the collections of large libraries (like the LoC) there will be a public catalogue entry with the ISBN attached, but they don't assign it.

There were also a lot of books published before ISBNs were created, and not every book has an ISBN attached even to this day.

toddh 5 years ago | | |

On amazon you can get an ASIN so you don't have to buy an ISBN. So there's no universal book identifier.

drusepth 5 years ago | |

I built a private Goodreads "competitor" for friends (with groups and book clubs and whatnot) using the Open Library data dumps for book/author/publisher data (since GR APIs were too restrictive in how they could be used). They're great, easy to use, and the site behind them looks like it's run well and stable (edit: didn't realize they're under Internet Archive!). Would definitely recommend them as an alternative.

throw0101a 5 years ago | |

See also WorldCat:

> WorldCat is a union catalog that itemizes the collections of 17,900 libraries in 123 countries and territories[4] that participate in the OCLC global cooperative. It is operated by OCLC, Inc.[5] The subscribing member libraries collectively maintain WorldCat's database, the world's largest bibliographic database.[6]

* https://en.wikipedia.org/wiki/WorldCat

* https://www.worldcat.org

sohkamyung 5 years ago | |

I'm looking for a site that supports not only books but short stories.

My primary interest is in recording my thoughts on books and stories I've read, so a review site is what I'm looking for. This is more challenging for short stories as they don't have a ISBN, may appear online, or in a fiction magazine or in an anthology.

So, I may put down my thoughts on short story X that I read in book Y, but if I look up book Z that also features that short story, my thoughts on the story would also appear there.

In short, I'm looking for a site that also records short stories like the ISFDB [1], but allows users to add reviews.

So far, I haven't found one. I'm now putting down my notes on short stories in a Zotero database.

[1] http://isfdb.org/

markdown 5 years ago | |

Do you know how to filter a search on openlibrary by books that are in the library? It's annoying to search and get hundreds of "Not in Library" results.

dredmorbius 5 years ago | |

The "one page for every book" goal seems to position Open Library as a rival to OCLC (Worldcat), as well as the almost perfectly useless Hathi Trust.

einpoklum 5 years ago | |

Should this be a separate HN post?

numair 5 years ago |

This makes absolutely no sense and has no relation to any economic variables. Goodreads isn’t some struggling self-funded startup — it’s owned by Amazon.com. The acquisition was a deal that should have never been approved, if the Obama administration had been anything beyond completely impotent at protecting us from monopoly games:

https://www.theguardian.com/books/2013/apr/02/amazon-purchas...

I would like to understand the true strategic interest behind this. Is Amazon simply penny-pinching now that they’ve successfully obliterated the market for both new and used books online? There’s way more to this story than appears on the surface.

dash2 5 years ago |

Key para:

The web has to mature beyond advertising as a business model. For this to happen people are going to have to open their wallets, pay for the services they use, and support independent businesses. That’s how we build a web where indies can thrive - one that’s more village centre than financial centre. I think the shift is underway.

True/false?

captn3m0 5 years ago |

>So this is an “announcement” much in the way a windshield announces its presence to bugs on a highway

This is a very poetic way to describe API deprecation. I'm gonna steal this.

chrismorgan 5 years ago | |

Except this isn’t so much deprecation as removal. Deprecation says “we don’t recommend this any more, and it’ll probably be removed in the future”. Deprecation would be quite unlike the windshield analogy, because it is an announcement.

(It’s not quite cut and dried in this case because there may still be some people that still have access to the API—but those that have been cut off look to have no recourse.)

ignoramous 5 years ago | | |

> Except this isn’t so much deprecation as removal. Deprecation says “we don’t recommend this any more, and it’ll probably be removed in the future”.

At AWS, I was "put on a pedestal" for stating (in an internal forum) that X was deprecated (I meant it in a sense it was "not recommended anymore" and wasn't updated at all)... The management thought it sent the wrong message (that is, deprecation == removal).

People often associate wrong meaning to deprecation (ironic in my case given Amazon is a Java shop).

booleandilemma 5 years ago | |

And just like one must scrape bugs off one’s windshield, so people will have to scrape content off Goodreads, or something.

swilk001 5 years ago |

My whole app depends on the Goodreads API so I have to shut it down.

https://blog.stephanieawilkinson.com/posts/2020-12-10-yonder...

vuciv1 5 years ago | |

I'm sorry. that's awful. if you plan to start a new one, let me know I'd love to check it out.

i was just planning on making one, but that dream is dead now

swilk001 5 years ago | | |

ok will do! do you have a twitter handle? i never use HN :-D

I'm twitter.com/stephanieblack

judge2020 5 years ago | |

A lot of others in this thread have mentioned the API being neglected for a long time, did this impact your app?

swilk001 5 years ago | | |

There was no support for devs, but the bones were good and they didn't make any changes, so I managed to make it work.

I used both the OAuth and Shelves APIs.

gravitas 5 years ago |

LibraryThing has a Goodreads importer which I have used in the past with good success (nothing is perfect).

https://www.librarything.com/more/importgoodreads

The general LibraryThing UI is a bit scary at first, don't let that stop you. All "Pro" accounts were made free for everyone some years back.

milofeynman 5 years ago | |

I also use library things, and thanks to their change to their business model their free accounts work great for tracking my home library and what I read. I also like their app for scanning barcodes into my library.

mindracer 5 years ago | |

Amazon owns a 40% stake in LibraryThing which I only found out after moving to it

dpeck 5 years ago |

Remember those few years from around 2008 to 2013 or so when open APIs were cool, mashups of different services were everywhere, the web felt just a bit younger and more carefree, and people didn’t immediately sneer when they learned that you were a software developer?

Those were good times.

spideymans 5 years ago | |

>people didn’t immediately sneer when they learned that you were a software developer?

People sneer now?

dpeck 5 years ago | | |

There’s quite a bit of animosity directed at various tech companies from people all over the political spectrum. That is definitely bleeding over into the professional generally.

satyanash 5 years ago |

Note that you can still export a CSV of your books, (although this is not all the data that is present).

Here https://www.goodreads.com/review/import

gravitas 5 years ago | |

It's still a pretty good backup which I've imported to other services before, here's the CSV header row for those interested:

Book Id,Title,Author,Author l-f,Additional Authors,ISBN,ISBN13,My Rating,Average Rating,Publisher,Binding,Number of Pages,Year Published,Original Publication Year,Date Read,Date Added,Bookshelves,Bookshelves with positions,Exclusive Shelf,My Review,Spoiler,Private Notes,Read Count,Recommended For,Recommended By,Owned Copies,Original Purchase Date,Original Purchase Location,Condition,Condition Description,BCID

The CSV export uses both quoted and unquoted fields at the same time on the same record which is unfortunate, but it works.

a_bonobo 5 years ago | | |

Yeah, the Python csv package hasn't had problems for me yet.

One unfortunate bug that they seem to have put onto the 'wont-fix' pile is that for many recent-ish books, the 'date read' field isn't properly exported, so if you try to make reading stats you have to cheat a bit by approximating the 'finished date' with the 'book added' date.

aminozuur 5 years ago |

Goodreads has not changed in 10 years. See comparison pic of 2010 vs 2020: https://twitter.com/aminozuur/status/1338037049941757953

It's sad that Goodreads way forward is to stifle competition, rather than innovate.

rdl 5 years ago |

Goodreads has just generally been "sad" for a very long time. Even before the Amazon acquisition, the site has been slow, hasn't actually innovated in anything, and basically is the equivalent of a late-90s/early-00s craigslist -- a mediocre but "good enough" service which squats on a market preventing better competitors from existing.

Merman_Mike 5 years ago | |

It kills me to see because Amazon is the entity that could build the best book recommendation engine of all time. And it would help them sell more books!

Siira 5 years ago | | |

What ideas do you have? I find Goodreads has the features, but its UX is rather old-fashioned.

nefitty 5 years ago |

The LibraryThing API might be an approximate replacement, although I’m not sure what the major differences are: http://www.librarything.com/services/

hirako2000 5 years ago | |

Interesting. They could do a better UI, mobile responsive, but their db is pretty big.

iou 5 years ago |

That site they're building looks dope https://beta.readng.co

podviaznikov 5 years ago |

I made a tool[1] a year ago to export Goodreads reviews into markdown and synced it Dropbox. Wanted to have two way sync, but I guess not anymore.

[1] - https://borges.ai

miguelrochefort 5 years ago |

Based on the quality of MusicBrainz [1], I thought that BookBrainz [2] could be a good alternative, but unfortunately it looks rather incomplete.

[1] https://musicbrainz.org [2] https://bookbrainz.org/

asplake 5 years ago |

What kind of API does/will Readng have? Bookseller integration?

I have a use case: the bibliographies (recommended reading pages) for my own books. Could I send readers to their choice of bookseller?

buttscicles 5 years ago | |

Hi, author here!

Would love to have a public API for readng but don't want supporting it slowing us down when we need to change something. It's just the three of us in our spare time at the moment, so we need the agility!

I really, really like that use case too! Right now all you could do would be to create a collection of books on your profile, but would be nice to have on the book page I think.

> Could I send readers to their choice of bookseller?

We would like to refer people to libraries and indies, but these are obviously fragmented so a bit difficult. I think allowing authors to set a preferred book seller probably makes sense.

smarx007 5 years ago | | |

Could you please at least create an export button under https://beta.readng.co/settings so that we can migrate to readng.co without having similar concerns that made us to sign up for it today? And ofc it has to be machine-readable, e.g. JSON or XML but CSV should work too (for CSV ideally it would be in the same format as the one from GR to reduce fragmentation in this space). This is also a must under GDPR, so not going to be an effort wasted.

Cenk 5 years ago | |

https://boook.link is a useful tool for offering people links to different stores

asplake 5 years ago | | |

Thank you!

_iyig 5 years ago |

Sad but not surprising that Amazon let Goodreads moulder. They did the exact same thing with Shelfari [0], formerly Goodread's main competitor.

[0] https://en.wikipedia.org/wiki/Shelfari#Amazon_and_shutdown

iou 5 years ago |

Roadmap for that reading app is here https://readng.nolt.io/roadmap

Personally I'd like to see audible integration, which ironically goodreads does not provide :(

padraigf 5 years ago |

I'd just started a hobby project to use the Goodreads API for book recommendations. That's not going to happen now.

I feel bad for people who put more work than I did into using the API. It seems kind of short-sighted to me, I'd have thought anything promoting book-reading would be good for Amazon/goodreads.

Hopefully it's the catalyst for a good alternative to appear. I know developers weren't happy with the API, or users with the site in general.

hoyd 5 years ago |

One of the very first programs I wrote, was a python script that used this API. I enjoyed the learning and what I could do with it. This is indeed very sad.

throwanem 5 years ago |

Tangential, but I've always been curious. What's the use of a public "what I'm reading/have read/will read" in the first place? Goodreads has users so I guess it has some value, and I do occasionally look at reviews there, but I'd be interested to hear the perspective of someone who uses and likes this kind of thing on what's to like about it.

67868018 5 years ago |

Storygraph is the new hotness

https://beta.thestorygraph.com/

stakkur 5 years ago |

Amazon owns Goodreads. Goodreads is nothing but a sales funnel, and Amazon doesn't like public APIs to their data.

BigBalli 5 years ago |

Truly a bummer although Goodreads was never truly "happy" with third party. I created https://MyBookList.club and messed around with many book providers, GR was always last.. only strength is its large user base.

kopakabana 5 years ago |

Not having an API is short-sighted.

Those who will use your info legitimately will probably use an API, but those who only want your data can hide their IP across a number of cloud agents to extract all of the data from your site regardless of whether you offer an API or not unless you use CAPTCHA.

eznzt 5 years ago |

With time everybody learns if your time is worth something you will be scrapping, not using an API.

octoberfranklin 5 years ago | |

I dunno, maintaining a scraper across neverending site redesigns is a friggin' lot of work. Way more work than updating my code to use a new version of an API.

eznzt 5 years ago | | |

When was the last time goodreads was redesigned?

genidoi 5 years ago | | |

If all that’s changing on a site is the presentation/ordering of elements, then you just need to update some XPath selectors every now and then and have a notification system for knowing when those selectors aren’t working.

You can even automate this process by having a known input (eg, URL to a book on goodreads + its known [and hopefully unchanging] book title) and have a script that periodically checks that the xpath string matches the known pages text / generates a new one to point to the title. This is harder for values that do change but there are always workarounds

hiq 5 years ago | |

It depends on what you do with the content: if you're just using it for yourself and can afford it to break at random times, scrapping can be fine (although maybe still more time-consuming depending on the website).

But if you need predictability and reliability (e.g. you're providing a service to other people) for whatever you implement using this 3rd party service you don't control, relying on their ui that they can break any time they feel like it will lead to more downtime than APIs for which you're usually given some notice before they're deprecated.

lkrubner 5 years ago |

I think if the Justice department was more technically savvy, and in one of its strongly anti-trust eras, they would see this as Amazon extending its monopoly power, and the Justice department might stop this.

deep_merge 5 years ago |

Can anyone recommend a Goodreads alternative?

user5994461 5 years ago |

Let's say you have the whole database of books from Amazon/GoodReads, including title/authors/genres/publishingdate/userrating/sales.

You'd like to make a recommendation engine, the idea is that the user could input 1-3 books they liked and it would suggest more books that are similar.

What sort of algorithms should I look into to do that sort of things?

Note that I don't have user profiles with what book they read, only have the database of books, can't do the recommendation engine based on two users liked the same book so they will also like other books either of them liked.

meekrohprocess 5 years ago | |

IMO, you could highlight the differences between similar titles.

It's easy for recommendations to get stuck on a local maxima if they only look at one metric at a time, like "similarity" weights. But if you have a lot of metadata about each title, you can break out of those "loops" by sprinkling in metrics like ratings/genres/release date/popularity/etc. This doesn't have to hurt from a performance perspective, either; you can filter on the same single metric, but request more recommendations than you need and pluck out a pseudo-random set in the application logic.

That also lets you provide context for the recommendation. "It's like this, but [older/more obscure/with vampires]."

inoop 5 years ago | |

AWS has a service for that, it's called [personalize](https://aws.amazon.com/personalize/)

hombre_fatal 5 years ago | |

fwiw you’re asking how to build a reco engine without any of the necessary data needed to build one. Finding good data is the hard part. That’s the part that stops every HNer from building a reco engine as a hello world weekend project.

jbaber 5 years ago |

I'm really liking https://beta.thestorygraph.com/ especially for suggestions so far.

https://www.newstatesman.com/science-tech/social-media/2020/...

loosetypes 5 years ago |

What’s a good way to group isbns for various versions of the same title?

For example, searching Old Man and the Sea on isbndb returns (as you’d expect) many isbns:

https://isbndb.com/search/books/Old%2Bman%2Band%2Bthe%2Bsea

Do books have another identifier that logically consolidates editions, foreign language prints, etc.?

cdrini 5 years ago | |

Open Library has a notion of "Works" which group together editions across languages: https://openlibrary.org/works/OL63073W

(Note the work id is in the url: OL6307W)

If you want to get the work id for a given ISBN: https://openlibrary.org/isbn/2070360075.json will redirect to the edition page, and there you can get "works[0].key".

Or, you can search by the isbn: https://openlibrary.org/search.json?q=isbn:2070360075

loosetypes 5 years ago | | |

Thanks. So Open Library implemented that because there's no interoperable, universal concept for literary "works"?

And then each of Goodreads, Amazon Retail, Ingram, et al. would likely have their own internal system for isbn identification and grouping accordingly?

And I guess that means similarly, there's no isbn-analog for a series or interrelated set of "works"?

gjreda 5 years ago |

Recently wrote some code to scrape a friend's reviews and ratings from Goodreads. Maybe it'll be useful to folks here: https://gregreda.com/2020/11/17/scraping-pages-behind-login-...

strifey 5 years ago |

I was super disappointed in this but not surprised, honestly. Started building features for a Discord bot to share book reading updates, etc. with friends, and the API was rough to say the least (especially their OAuth implementation). You could tell they hadn't dedicated any resources to it in a long time.

nfriedly 5 years ago |

That's a real disappointment. I've gotten back into reading in the past few years, I've been tracking everything in goodreads, and I was just thinking about building something to display what I've been reading recently on my website that would pull from the goodreads API. There goes that idea...

kylebenzle 5 years ago |

I've been building a GoodReads scraper for a long time that (I think) can do just about anything the API can.

https://github.com/KyleBenzle/Good-Reads-Scraper

doublejay1999 5 years ago |

> open their wallets, pay for the services they use, and support independent businesses

Except these things began as hobbies before business became involved at all.

It’s a message board for book lovers ffs, pre web people used to build things themselves, at their own expense, just for fun.

hombre_fatal 5 years ago | |

Yet all of my favorite early web places have died out unless they could pivot into a business. I don’t want someone’s precarious, eternal charity as the only thing keeping something alive. It’s in my best interest that they make a living from it if I care about the service at all.

mikejethi 5 years ago |

The web has to mature beyond advertising as a business model. For this to happen people are going to have to open their wallets, pay for the services they use, and support independent businesses.

I hope it does! I'm banking on it too for my project.

miguelrochefort 5 years ago |

I recently built an audiobook scrobbler for Android that automatically updates the status of books I read on GoodReads.

I'm very disappointed to see that they're killing their API. I'll be looking for an alternative.

billfruit 5 years ago |

Goodreads really needs a solid competitor, the site seems to be stuck in a timewarp:

It is load times are one of the longest of the sites I commonly visit, and it barely had any new functionality added in the last 10 years.

hombre_fatal 5 years ago | |

They exist, have you looked? Though none of them can compete with the Kindle’s dedicated Goodreads button.

loosetypes 5 years ago |

What’s a good data source for book metadata and covers these days?

thrower123 5 years ago |

Does anybody know what these Goodreads APIs that are being deprecated actually are?

I'm struggling to imagine what they could be; Goodreads doesnt exactly have much data that's really useful.

sradman 5 years ago | |

https://www.goodreads.com/api

justhw 5 years ago |

Welp..If you just need the make a book list features, i made an alternative a while back.

https://bookshulf.com/

leethargo 5 years ago |

Is https://www.whatshouldireadnext.com/ a viable alternative?

jiggawatts 5 years ago |

I'm curious: What were people using this API for exactly?

towelpluswater 5 years ago |

If someone can manage to find a creative way to tap into the Kindle ecosystem and provide a similar service, I'd imagine they could do quite well.

chrisweekly 5 years ago |

Equal parts sad and unsurprising.

benibela 5 years ago |

Guess, I am not going to add goodreads support to my reading-tracking library app

vuciv1 5 years ago |

damn, this is really disappointing. I use Goodreads, Letterboxd, and GG pretty frequently and wanted to use the three APIs to just have all these sites' functionalities in one place.

Guess it won't be happening now...

newbie578 5 years ago |

This is such an interesting topic to discuss. The book industry by itself is somewhat in a state of limbo.

Goodreads is basically the only major social network for bookworms, yet the majority of its users hate it (including me), but are forced to use it to their chagrin.

You would think that would make the market ripe for a disruptor to arrive and topple the incumbent leader, yet each year nothing happens.

I personally have also thought of making a new rival product, but when you do the math on the market potential and the financial benefits, I just don't see a viable way.

People who read books, even if they read them every day, won't use your social network each day since books by themselves are the type of content which is consumed the longest (compared to a movie, tv show, song, or video game).

So you have a social network where users come back on a whim, even if you read like a maniac and try to read one book per week (I tried it one year, it was crazy, you are basically spending all your free time reading), even then users wouldn't use your app each day, but perhaps once or twice a week, and who knows how much time would the average session last?

To make things worse, you could maybe even get away with users using your app once a week if you have a big enough market (user base), but the number of book readers is not that great (especially compared to other media consumption)... The median American reads 4 books a year [1], or simply put, one book in three months.

So you have a social network where users don't need to use it often and there aren't a lot of users, that already spells trouble, but there is another major issue.

You could even succeed with those issues if you had a highly commoditized product to advertise, let's say a social network for yacht lovers, even if you have a small number of users and they do not use it much, you can still manage to succeed with it, since if you advertise yachts to potential yacht owners, you have a very valuable marketing channel which is worth quite an amount to the right people.

You can see where I am going with this, just compare a 5% commission on a yacht, vs a 5% commission on a book... Unfortunately books are not so highly valued (in monetary terms) nor sought after.

To sum up, you have a social network where users don't spend a lot of time, you don't have a lot of users and it is centered around a low profit product... Of course Goodreads has no competition, no sane person would touch that market with a ten foot pole.

Yet, to quote George Bernard Shaw, "all progress depends on the unreasonable man". If someone manages to solve this problem and find a profitable way to survive, I would not be surprised to see Goodreads fall.

I even thought of contacting Scribd to work with them, since I think they might have the best shot currently to position themselves as market leaders. They have an excellent product (Netflix for books) and already have a well sized user base. Would be interesting to see them expand and also became a social network for book lovers.

[1] - https://www.bustle.com/p/how-many-books-did-the-average-amer...

[2] - https://www.goodreads.com/quotes/536961-the-reasonable-man-a...

banach 5 years ago |

Time to support another reading community! Anyone got any suggestions?

soft_dev_person 5 years ago | |

I have been considering making one. Goodreads is so awful, and the alternatives I have found are all quite quirky or broken in one way or another.

But having a database of "all books" is not necessarily trivial. Even though OpenLibrary really does provide a great start, the contents seem to come from Goodreads/Amazon in a lot of cases, and I'm concerned about the legality of making a commercial competitor based on it.

Also, it would take a lot of time and data to get a good recommendation engine going. Amazon really is in the best position to do this. Just a shame that Goodreads get so little love from them.

chucktorres 5 years ago | | |

If you embark on such a project, I would help. Drawing inspiration from https://www.themoviedb.org/

BlueTemplar 5 years ago | | |

Wouldn't Wikipedia have a database like that?

waylandsmithers 5 years ago | |

I've been working on a related product (never going to be a complete set of books, but you can read them through the site because they are public domain) that you can check out in my profile if interested

gundmc 5 years ago |

So much for the narrative that Amazon never kills any services.

postit 5 years ago |

Your content is the only currency you have on the internet.

OOPMan 5 years ago |

I used the Goodreads API a few years back. It was awful.

qwerty456127 5 years ago |

Very sad. Good bye Goodreads.

mro_name 5 years ago |

there seems to be no such thing as a reliable 3rd party.

Jay7722 5 years ago |

keep up !