Show HN: Applying the Unix philosophy to neural networks

Show HN: Applying the Unix philosophy to neural networks(github.com)

251 points by cloudkj 7 years ago | 64 comments

HALtheWise 7 years ago |

I don't see any claims about performance, but I would be very surprised if it was anything better than abysmal. In a modern neural network pipeline, just sending data to the CPU memory is treated as a ridiculously expensive operation, let alone serializing to a delimited text string.

Come to think of it, this is also a problem with the Unix philosophy in general, in that it requires trading off performance (user productivity) for flexibility (developer productivity), and that trade-off isn't always worth it. I would love to see a low overhead version of this that can keep data as packed arrays on a GPU during intermediate steps, but I'm not sure it's possible with Unix interfaces available today.

Maybe there's a use case with very small networks and CPU evaluation, but so much of the power of modern neural networks comes from scale and performance that I'm skeptical it is very large.

enriquto 7 years ago | |

> I don't see any claims about performance, but I would be very surprised if it was anything better than abysmal. In a modern neural network pipeline, just sending data to the CPU memory

Notice that the bulk of data does not necessarily go through the pipeline (and thus by the cpu). You may only send a "token", than the program downstream uses to connect to and deal with the actual data that never left the gpu.

ken 7 years ago | | |

This is similar to the Tcl approach. “Everything is a string”, but the string may be something like “the network socket at address 0x12345678”.

jasonhansel 7 years ago | | |

Would be nice if Unixes provided better support for pipelines of this sort...

bollu 7 years ago | | |

Pretty sure you can't do this since the GPU's memory is per-process isolated as well.

magpi3 7 years ago | |

> Come to think of it, this is also a problem with the Unix philosophy in general, in that it requires trading off performance (user productivity) for flexibility (developer productivity)

“Sure, Unix is a user-friendly operating system. It's just picky with whom it chooses to be friends.” ~ Ken Thompson on Unix

But seriously, I would argue that Unix is "superuser" friendly - very friendly to advanced users who like their power tools, and is only unfriendly to those who want to have a more casual relationship with their computer (which admittedly is probably 98% of users).

I am not really a developer anymore, but any system that expects me to use a mouse over a keyboard makes me feel less productive.

AnIdiotOnTheNet 7 years ago | | |

It's not even friendly to superusers, it's a horrifically complicated mess of cobbled together archaicness.

dllthomas 7 years ago | |

> trading off performance (user productivity) for flexibility (developer productivity)

Flexibility can often be "user productivity" as well.

jmmcd 7 years ago | |

This is only for inference, which is very cheap already, cheap enough for most applications (real-time video processing an exception). Training is the slow part which is worth putting on a GPU.

bigred100 7 years ago | |

I mean, some people I know achieve developer flexibility and performance by just writing their own implementation in FORTRAN. Unfortunately this is inadvisable for many people and seen as undesirable by even more people

mark_l_watson 7 years ago | | |

note that this uses BLAS that is largely implemented in FORTRAN

nsilvestri 7 years ago | |

> trading off performance (user productivity) for flexibility (developer productivity)

Is this not something that can be helped with more advanced compilers?

peter_d_sherman 7 years ago |

Excerpt: "layer is a program for doing neural network inference the Unix way. Many modern neural network operations can be represented as sequential, unidirectional streams of data processed by pipelines of filters. The computations at each layer in these neural networks are equivalent to an invocation of the layer program, and multiple invocations can be chained together to represent the entirety of such networks."

Another poster commented that performance might not be that great, but I don't care about performance, I care about the essence of the idea, and the essence of this idea is brilliant, absolutely brilliant!

Now, that being said, there is one minor question I have, and that is, how would backpropagation apply to this apparently one-way model?

But, that also being said... I'm sure there's a way to do it... maybe there should be a higher-level command which can run each layer in turn, and then backpropagate to the previous layer, if/when there is a need to do so...

But, all in all, a brilliant, brilliant idea!!!

kwaugh 7 years ago | |

> Now, that being said, there is one minor question I have, and that is, how would backpropagation apply to this apparently one-way model?

The author mentioned that this is only for inference of neural networks (not training), so this does not support backpropagation.

PeterisP 7 years ago | | |

This kind of misses the point of the Unix philosophy of being able to dynamically reconfigure things - realistically, to get decent results, you'll need to do inference with the exact same connections as you trained (or at least finetuned) them, so there's no good reason to split the model in smaller parts.

mempko 7 years ago |

What's wonderful about this concept (and unix concept in general) is that the flexibility it gives you is amazing. You can for example pipe it over the network and distribute the inference across machines. You can tee the output and save each layers output to a file. The possibilities are endless here.

skvj 7 years ago |

Great concept. Would like to see more of this idea applied to neural network processing and configuration in general (which in my experience can sometimes be a tedious, hard-coded affair).

craftinator 7 years ago |

I've been thinking about something like this for a long time, but could never quite wrap my head around a good way to do it (especially since I kept getting stuck on making it full featured, i.e. more than inference), so thank you for putting it together! I love the concept, and I'll be playing with this all day!

xrd 7 years ago |

This might not be a great way to build neural networks (as other commenters have said regarding performance). But, it could be a great way to learn about neural networks. I always find the command line a great way to understand a pipeline of information.

luminati 7 years ago |

Great idea but however equally great caveat - it's just for (forward) inference. Unix pipelines are fundamentally one way and this approach won't work for back propagation.

bigred100 7 years ago | |

I don’t see any reason you couldn’t just spit out the output and the derivative of the layer output with respect to the weights, then multiply and carry these all the way down. Then if you have a loss function at the end you have the gradient. Probably this project is for fun and not scale so it’s fine. But then you need to think about changing the weights on every layer based on the optimization

Rerarom 7 years ago |

Sounds like the kind of thing John Carmack would enjoy hacking on.

fouc 7 years ago | |

How's that? Isn't he more of a C/game developer? Is he a unix guy?

snazz 7 years ago | | |

https://news.ycombinator.com/item?id=9810342

John Carmack working on Scheme as a VR scripting language

Donald 7 years ago |

See also Trevor Darrell's group's work on neural module networks:

https://bair.berkeley.edu/blog/2017/06/20/learning-to-reason...

mark_l_watson 7 years ago |

Wonderful idea and the Chicken Scheme implementation looks nice also.

I wrote some Racket Scheme code that reads Keras trained models and does inferencing but this is much better: I used Racket’s native array/linear algebra support but this implementation uses BLAS which should be a lot faster.

dekhn 7 years ago |

https://www.jwz.org/blog/2019/01/we-are-now-closer-to-the-y2...

bitwize 7 years ago | |

Linking to jwz directly from hackernews will show a nutsack in your browser and not anything useful.

dTal 7 years ago | | |

Unless you use an extension that hides your referer. Which you should - it's a needless privacy leak, and permits people to play stupid games like this. I use Smart Referer.

https://addons.mozilla.org/en-US/firefox/addon/smart-referer...

toxik 7 years ago | | |

What an absolutely childish thing to do.

dekhn 7 years ago | | |

yup I know :)

cr0sh 7 years ago |

Well - I will say I like the general concept. I just wish it wasn't implemented in Scheme (only because I am not familiar with the language; looking at the source, though - I'm not sure I want to go there - it looks like a mashup of Pascal, Lisp, and RPN).

It seems like today - and maybe I am wrong - but that data science and deep learning in general has pretty much "blessed" Python and C++ as the languages for such tasks. Had this been implemented in either, it might receive a wider audience.

But maybe the concept itself is more important than the implementation? I can see that as possibly being the case...

Great job in creating it; the end-tool by itself looks fun and promising!

andbberger 7 years ago |

Reading the title, I can't help but think of the 'Unix haters handbook' and groan, why would you want to apply the unix philosophy to nets??

F-0X 7 years ago | |

>I can't help but think of the 'Unix haters handbook' and groan

The one true gripe they outlined is that the user interface requires some explanation before one can bootstrap their own knowledge.

I'll defend the unix philosophy till I die, probably. Why _wouldn't_ you apply it here?

bitwize 7 years ago | | |

Because all the time spent marshalling and unmarshalling data structures into bags of textual bytes takes up CPU and does nothing but hasten the heat death of the universe. We have better models for that sort of thing. See: PowerShell.

ganzuul 7 years ago | |

Multithreading for one.

jawilson2 7 years ago | |

> why would you want to apply the unix philosophy to nets

I wouldn't and won't

nihil75 7 years ago |

I love you