Show HN: Octo – Generate a serverless API from an SQL query

Show HN: Octo – Generate a serverless API from an SQL query(octoproject.github.io)

241 points by khalidlafi 5 years ago | 58 comments

eatonphil 5 years ago |

I've got a similar project that reads your db schema and generates a Go REST API and a TypeScript/React web interface. (The code-generation is language agnostic so at some point I'd like to add at least a Java REST API as well.) It supports PostgreSQL, MySQL, and SQLite.

Unlike PostgREST/Hasura and some other dynamic tools you can "eject" at this point if you'd like and continue on development without the generator in a language you already know. But I'm working on exposing Lua-based hooks you could carry across whatever backend language you choose to generate and avoid the need to eject.

It has builtin support for paginated bulk GET requests with filtering, sorting, limiting. Built-in support for bcrypt-password authentication and optional SQL filters specified in configuration for authorization of particular endpoints based on session and request metadata.

Still very much a work in progress but the goal is to push the envelope on application boilerplate.

Screenshots are of the example/notes project in the repo.

https://www.dbcore.org/

https://github.com/eatonphil/dbcore

MuffinFlavored 5 years ago | |

I feel like projects like this work for simple stuff but as soon as you need analytics/insights or actual business logic, you almost always need to just "roll your own" API. Am I wrong? Do other people feel this way? Can anybody think of a few projects they've worked on that would be too complex/a ton of work to make work with these kind of simple template generators?

eatonphil 5 years ago | | |

Long-term maintainability is definitely my concern. I don't see projects like this so much as products themselves (maybe I'm myopic) but as core infrastructure. I don't want to ever write the boilerplate again, but I should be able to extend it maintainably over time (hence ejecting or Lua hooks).

My goal in building this is to allow myself to more rapidly prototype real, complex applications. It's not there yet but I've got such an application in mind, building toward support for it as I'm developing this.

robmccoll 5 years ago | | |

It seems like when working with generators, the trick is to have the right boundaries between generated code, points where you can extend the generated code, and the API through which you use the generated code. If successful, you should never feel the need to hand edit the generated code itself, and you shouldn't need to worry too much about re-running the generator breaking things or stomping on your code.

BlackCherry 5 years ago | | |

I think on average you're right. As the project grows, these generators lose some of their initial value and speed.

If I were to use something like this, it'd be for rapid development initially for prototyping purposes. Then I'd transition to something more bespoke as needed.

If you know all your requirements up front, then starting bespoke from the beginning may be the better route.

CuriouslyC 5 years ago | | |

With postgrest you can add stored procedures as rest rpc calls, and you can always roll a microservice for more advanced stuff. In practice these sorts of auto-api tools make a good starting point as long as they support your authentication and authorization needs.

lukeramsden 5 years ago | | |

Postgraphile allows you to write plugins to arbitrarily extend (GraphQL) schema and wrap resolvers, which is useful for things like interfacing with external APIs etc. It works just fine for me at my current project.

jon-wood 5 years ago | | |

The sweet spot for things like this is for them to generate the sort of code you'd want anyway, in a way which allows you to selectively replace bits where you need additional business logic or UI customisation.

Rails' generators are a pretty good starting point here, if a little bit more verbose than I'd like. They're great for getting the boilerplatey bits off the ground, and focusing on the bits that are unique to what you're doing.

manigandham 5 years ago | | |

Used PostgRest once. We had a project with datasets for sale where all the data was stored in a database and didn't need any updates or transactions. Postgrest was easy to create a self-contained API.

https://github.com/PostgREST/postgrest

chadhutchins10 5 years ago | | |

DreamFactory is basically a paid service for this sort of thing. They support something like 20 types of databases (among many other data sources). They have a lot of features that make the exposed api be good enough long-term. https://www.dreamfactory.com

jjeaff 5 years ago | | |

Ya, I have very few tables in my db that can be updated directly without needing to pass the data through some business logic, trigger notifications, etc

FinalDestiny 5 years ago | | |

I’m using Prisma right now and they allow you to manually expose your fields, and they allow you to “resolve” any field as you see fit. If you want to run the original function, you can provide “originalResolve” and call it later. I think Prisma has a great (albeit in progress) way of doing what you’re saying.

It also integrates with graphql-codegen so you can generate code for apollo/others

henryfjordan 5 years ago | |

This is not dis-similar to what Strapi.io does, although I don't think they realize that's a big selling point from their marketing materials.

With strapi you configure your DB and get code generated in JS that supports a standard CRUD REST API. If you want to add business logic, you can override any particular endpoint you want. Their docs even come with the default implementation for easy copy/paste.

I would love to see research in this space continue, I think it's the future of bringing non-technical people into the product development process (if you can understand building a workflow with Excel/Google Sheets/Airtable, you can understand building an API). I'm excited to check out your project.

RileyJames 5 years ago | | |

My thoughts exactly as I’ve been exploring strapi for a recent project.

The UI allows for build out the schema, and has all the CRUD interfaces pre-built and the CRUD api endpoints.

But then the code is still all there and you can add additional controllers actions, model hooks and services.

I haven’t run into any hard limitations yet, been very impressed.

jdc 5 years ago | |

In case anyone else is wondering what's doing the templating in this project, it's Scriban.

https://github.com/lunet-io/scriban

eatonphil 5 years ago | | |

A port of Ruby's liquid templates to .NET, yep.

Trisell 5 years ago |

The idea is interesting. But it looks like you end up with a yaml file that enumerates each of your tables/endpoints and the queries that back them. So are we exchanging the “complexities” of code, where we have control and testing, for the “lack of complexity” of yaml that becomes unwieldy and untestable in the name of “simplicity?”

stingraycharles 5 years ago | |

Don’t forget that at some point, you’ll want to generate the yaml from code, because otherwise it becomes impossible to maintain. And quickly you’ll find yourself back at square 1. :)

Glyptodon 5 years ago |

One of the things that's not obvious to me about things like this (and other similar tools) is where/how scopes/limitations/permissions are handled. I assume they either are or can be, I just never see it spelled out clearly. What am I missing?

akie 5 years ago |

Perhaps I'm old, but who needs an API for an SQL query? I'm not sure I understand the use case, or the advantage of something like this over a regular API call to a backend which would also allow you to do e.g. authentication. Enlighten me?

cube2222 5 years ago |

Looks great!

If you like this, check out OctoSQL[0]... Also in Go... Though OctoSQL lets you query multiple databases / files / event streams like kafka using SQL from your command line, not as a server, so a fairly different use case, but you should check it out nevertheless!

The naming clash is funny.

[0]: https://github.com/cube2222/octosql

jhoechtl 5 years ago | |

I realy like your tool. In fact I am slowly integrating it into a solution which will expose a REST API and workspaces identified by a UUID. In our organisation it is so common to receive an Excel or csv which you have to join with the database. Octosql is great for that.

I am wondering what future role badger will play in the future? It would also make a great additional KV backend btw.

cube2222 5 years ago | | |

That's really great to hear!

We're considering moving to a more in-memory model, as we're not sure if the badger storage idea was a good one and worth it.

TBH we're still not quite sure in what direction we'll be continuing. Though we're surely gonna be developing it further.

But currently we're considering a rewrite with multiple assumptions changed (column oriented).

ForHackernews 5 years ago |

Looks vaguely similar to http://postgrest.org/

lukeramsden 5 years ago | |

And Hasura, Postgraphile et al too. These, as well as PostgREST also give you much more flexibility in the form of plugins in library mode and other such things - they also generate the actual queries for you, via introspection, as opposed to this which requires you to write the query yourself.

I think there's certainly space for this project, i.e. hand-written queries, on any database (Postg[REST|raphile] both only work with Postgres of course, not sure about Hasura). Not sure it will succeed without support for more forms of Serverless deployment, primarily Lambda.

alexellisuk 5 years ago |

Nice to see openfaas featured here and thanks for your PRs to Arkade. I do wonder what your strategy is on connection pooling and authentication?

Also not keen on the passwords being kept in a plaintext file - someone will check that into git. OpenFaaS has secret support which you can use Amal. So does Knative.

mitjam 5 years ago |

Reminds me of the venerable Datasette by Simon Willison: https://github.com/simonw/datasette

o1lab 5 years ago |

Interesting concept and quite liked the playful logo. Can we pass in env variables to db connection ?

We are in similar space, we take input params of db and generate CRUD apis with Auth+ACL and then APIs are packed into a single lambda function. There is support for serverless framework as well.

[1]: https://github.com/xgenecloud/xgenecloud

amal_kh5 5 years ago | |

Yes, for example:-

? Enter the database password: ${DB_PASSWORD}

alexzender 5 years ago |

Interesting, I've built a similar project that generates GraphQL API based on your database schema - https://okdb.io

bobbywilson0 5 years ago |

My main purpose of tools like these has always been prototypes, or hobby one-off type stuff. For SPAs, or a sketch with a Jupyter notebook. They're great for this sort of thing because in my experience, this used to require building some sort of API just to get a simple json interface to the database. It was my understand that the purpose of these types of tools was mostly that.

Are folks using these kind of things for non-trivial production applications?

hn_throwaway_99 5 years ago |

I fear that all of these "expose your DB as an API" tools like this, Postgraphile, Hasura, etc. are going to set up folks for a world of hurt down the road. Tightly coupling your end clients to your database schema can make it extremely difficult, if not impossible, to refactor your DB in you need to (which is highly likely).

brycelarkin 5 years ago | |

I’m building a project using one of those tools. I imagine that difficulty refactoring your database is more a problem of bad schema design than the tool. If you normalize and abstract out the implementation details into Views, I can’t see how refactoring would be difficult. Haven’t built anything at scale with Postgraphile/Harusa, so just wondering if I’m missing anything here.

TylerE 5 years ago | |

Views make it trivial to decouple what a query returns from the underlying schema

modarts 5 years ago |

Looks like someone took this tweet literally https://twitter.com/davecheney/status/1296033304756404225

rimkms 5 years ago |

Logo is looking good , gj

revskill 5 years ago |

Do u know any similar tool wwhich supports group by query ?

amani92 5 years ago |

Very impressive, Great job.

WrtCdEvrydy 5 years ago |

Your timing is perfect.

ahmadbana 5 years ago |

Very interesting projects and can be scalable Keep up the good work