Training an AI to convert design mockups into HTML and CSS

Training an AI to convert design mockups into HTML and CSS(medium.freecodecamp.org)

232 points by AshishGupta93 8 years ago | 102 comments

_xnmw 8 years ago |

The example mockup image shown is really unimpressive. As usual, the difficulty is in implementing the last 20% of edge cases and design subtleties, not well-defined black-and-white block layouts like the given example. It would've been simpler to build a classical program to handle that example, and the result would be much better and more predictably structured.

It's like programming a self-driving car for a 2D game. Trivial. Now try taking that to tar roads. I don't know why this toy example is even news, it's nowhere close to the difficulty of real-world design mockups.

The AirBnB sketch2code example is a lot more impressive, but that's basically just handwriting recognition with a 1 to 1 mapping of symbols to code pieces.

candu 8 years ago | |

Do we really have to crap all over this person's work? It's clearly intended as a learning exercise / tutorial written by someone who's in the process of learning themselves, and not as "breaking news" or a production-ready system.

IMHO we should be encouraging this sort of engaged learning.

Rexxar 8 years ago | | |

If he just try to learn something it is unappropriate to start the article with this sentence : "Within three years, deep learning will change front-end development. It will increase prototyping speed and lower the barrier for building software."

headcanon 8 years ago | |

A usable service taking advantage of this is still a long way off, but we don't get there unless people do the legwork.

One thing that could be usable more quickly would involve simplifying the problem space a bit. For example, the airbnb example using symbols could translate to, say, react components. If the grid system is well designed and you have a mature library of components, one could easily come up with interfaces that would generate react code, and ideally work well enough to serve as a starting point for the engineer to take over and turn it into a working app.

A system like this also need not be used in production-ready apps - I can envision a scenario where PM and designer can quickly sketch out and test ideas with working interfaces, without direct engineer involvement. That in and of itself would change front-end development significantly as the article claims.

haggy 8 years ago | |

Found the front end developer :D

But all jokes aside, I have to agree with others replying to your comment that this looks to be a learning exercise. It shouldn't be berated or beaten with a club. Without this kind of curiosity, the tech industry wouldn't be where it's at today.

joe_the_user 8 years ago | | |

From the article:

"Within three years, deep learning will change front-end development. It will increase prototyping speed and lower the barrier for building software."

-- This is an article that somewhat breathlessly claims that AI will be the thing plugging the "last mile" problem in the coding of the front end.

I think myself and other other folks chiming in here would be less skeptical if this "last mile" problem hadn't existed since the 90s. And moreover, it seems like a conventional solution to this should work.

The problem is that last-mile, best front-code varies not based on the input image but on a multitude of contexts outside of the image itself - the server software, how the code will be used, etc.

I'd say this is indeed a problem for AI but it would require a distinct paradigm than the present train/test/output paradigm, more like a expert system that could modify it's behavior with natural language output.

jeffmcmahan 8 years ago | |

Yes, this is typical AI weenie optimism. "We can handle many of the easiest cases, so in X years we will be able to do all the really hard things that we haven't even thought of yet!" No.

beached_whale 8 years ago | | |

You could have made you point in a far better way that encourages and doesn’t berate. Eg often people are optimistic but are unaware of the hard edge cases, the edge cases are interesting

maffyoo 8 years ago | | |

i dunno, we could equally say that your position is UI optimism. AI is new tech, certainly as it now becomes feasible to apply it to real problems. Given some of the challenges AI is solving right now this one is relatively trivial, HTML and CSS have specifications and there are well held paradigms and principals of User Interface design. The domain is very constrained so lends itself to AI very well. To say this is an exercise undertaken by the author and the very visible progress he has made is very impressive. If you read the entire article the work he does on recreating designs is even more so. Surely one of the goals of 'intelligence' artificial or otherwise, is to actually deal with "hard things that we haven't thought of yet" AI is solving (and unearthing) those problems right now. Imagine somewhere like squarespace where all you do is upload a design and website is produced as a result is that beyond the realms of possibility given this really impressive experiment produced by the author, definitely not. Id hate to think we had to contrive edge cases or fabricate complexity just so we can prove that AI cant sole every problem...

joe_the_user 8 years ago |

Aren't there a bunch of WYSIWYG tools for creating appearance and HTML/CSS together? Years ago when I looked, these existed (but designers prefered photoshop just because).

I mean, I'd assume also there's normally a give-and-take between designer, CSS-artist and client. The question is whether the neural network can also learn to take calls at 3am from a client wanting a different shade of aqua.

richjdsmith 8 years ago | |

I'm pretty sure Adobe is still making Dreamweaver.

I know that's how I built my first website way back in grade school.

notahacker 8 years ago | |

There are also plenty of existing tools for turning Photoshop mockups into HTML/CSS using hard coded assumptions about HTML structure and appropriate ways to slice images, with tradeoffs involving clunkier code, lack of intelligent support for other devices and window sizes and inability to change the shade of aqua at 3am unless you have a basic idea of how the output is structured.

From what I can see the Deep Learning process tried here needs a lot more technical knowledge to create a suitable training set for the type of output you want, a lot of iterations to converge on anything vaguely suitable, and even when it gets to what's considered an end product you're likely going to have to dive into the output code to correct button colours and text before we even start to think about its behaviour at different window sizes. As an experiment it's very interesting, as a replacement for the designer it's probably behind non-AI approaches to turning pics to code.

falcolas 8 years ago | |

> take calls at 3am from a client wanting a different shade of aqua.

Just have the AI include a duck with every creation.

https://blog.codinghorror.com/new-programming-jargon/ #5

pjmlp 8 years ago | |

Who takes calls at 3am when not on support shift?!?

TheAceOfHearts 8 years ago |

Writing HTML and CSS to implement a mockup is trivial for anyone with even a little bit of web dev experience. If you want high quality code, an AI isn't going to produce that.

I can't imagine ever using something so complex, if I can implement the same thing in a few minutes.

There's also further considerations when dealing with the real world. For example, you need to be aware of how to handle different accessibility features. Designers rarely seem to care about things like accessibility, but it needs to be declared somewhere.

shironineja 8 years ago |

This is what we are all working for. If we can automate the stupid stuff we do then we can work on other interesting stuff and automate it too.

I don't understand why we aren't all working on automating everything that we do.

Every line of code you pulled out of the code mine today is finite and given a business logic rule engine it's outcome could have been generated.

m3kw9 8 years ago |

What’s the point of the code isn’t human readable? You need to train the AI to write code that is easily modifiable with low dependencies. Otherwise the conversion isn’t very useful for anything unless your page is completely static

sebringj 8 years ago | |

Right but what if we didn't have to code anymore? Who would be reading it? Meaning, we don't worry about compiler readability anymore, we used to when assembly was fine tweaked etc. Just a thought. I work on full stack including front end animations etc but frankly its tiring and time consuming work to do that. Imagine if code was just you talking to it or showing it some whiteboards, then you would just talk to it again or show it other whiteboards. Code doesn't matter in that case.

sebringj 8 years ago | | |

Sorry replying to my own thread... I was just thinking, it all comes down to your will. Who will have the greatest will and orchestration of all these AIs combining them together. That will be the next thing, well its already happening.

vtange 8 years ago | | |

It's unlikely we will leapfrog from human-written code to relying on black-box generated code in one step. Everyone's been pointing to AI being used to augment existing jobs, as opposed to outright replacement.

In order to augment a front-end developer well, we'll need human-readable code, unless we like reading uglified code/make an AI for that too.

karmakaze 8 years ago |

Honest question: is anyone else sensing some defensiveness here? The timeframe is open to debate but what's more controversial is whether html/css is near the frontier of solvable problems. After AlphaGo, my worldview is changed forever. Glad to be still being valued for something I started out of joy but in no way feel it's not disruptable. Sometimes I even wonder why it hasn't advanced further since it's quite well defined, digital data in/out.

memebox3v 8 years ago | |

Yup, lots of ppl are very defensive about this! We are the farm workers looking at a prototype of a combine harvester. We will stop ourselves from believing it until its taken our job and then we will claim it was obvious all along. Ppl are emotional.

chillacy 8 years ago | |

My most uncharitable interpretation is that it's easy to cheer for AI when it replaces someone else's job, it's harder when it might replace your job.

nightski 8 years ago |

This is really cool and I believe necessary work. We need to start thinking about how to automate programming and to free ourselves much further from the mundane. Trust me, there is still plenty of room for people to analyze requirements and come up with a high level solution to people's problems that doesn't require fiddling with float: right and various technical hacks.

That said I wonder if this is the right approach. At some point "AI" as used in this context is just a function mapping from an input domain to an output domain. The output domain in this case, "code" was designed for human readability (whether it succeeded is a whole different approach).

What would a programming language designed for output from an AI system look like? How could we optimize it to reduce the output domain size of the function the AI has to train to learn? How could we optimize it to make the problem more tractable for machines? I feel like there is an entire field of research here. Maybe it has already been studied and I am just late to the game.

book_mentioned 8 years ago |

Turning web design mockups into code with Deep Learning | https://news.ycombinator.com/item?id=16115353 (Jan 2018)

>chrisfosterelli: This is a neural network that takes an image and predicts very simple blocks (like BODY, TEXT, BTN-GREEN in the bootstrap example) and then uses a map to convert them to well-formed HTML

>jamesjyu: I've always wanted to do a contest with other frontend coders to see who could get closest to a complex layout—like the NYTimes—in one go. >>janneklouman: these types of contests exist! I went to one of these[1] maybe two years ago in Stockholm and I had a blast [1] http://codeinthedark.com/

Pix2code: Generating Code from a GUI Screenshot | https://news.ycombinator.com/item?id=14416530 (May 2017)

ghostbrainalpha 8 years ago | |

That second link doesn't have anything to do with A.I. does it?

book_mentioned 8 years ago | | |

>a contest with other frontend coders to see who could get closest to a complex layout

I don't know of any record of past entries nor if any participants used A.I.; it doesn't seem likely.

EGreg 8 years ago |

Why are design mockups done in photoshop, when HTML and CSS can be faster and easier once you have the basic components??

kowdermeister 8 years ago | |

1) Because that's how most designers create layouts who can't code an not really interested in doing so.

2) It demonstrates AI capabilities on a new level.

3) If you design in PS or Sketch, then you will create more interesting layouts, because you won't skip things that are inconvenient or hard to do.

andanotherthing 8 years ago | | |

Does anyone use SketchApp to design at all? Just curious, I thought Photoshop was being replaced..

meatbags 8 years ago | |

I design all my sites in Photoshop before turning them into HTML/CSS. It's a lot faster than fiddling with CSS, especially if I've been given a lot of freedom in the brief.

Completely designing websites in PS before even opening the code-editor has really improved my workflow. I know exactly how it has to look because I have a visual reference that's been OK'd by the client.

sametmax 8 years ago | |

Because the best artist i know don't know css and html. They are not coders but drawers.

maffyoo 8 years ago | |

you forgot to mention emacs..!

efitz 8 years ago |

OK this is juvenile but I found it funny that after 250 epochs (https://emilwallner.github.io/html/250_epochs/) the AI just came up with busted HTML and: penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas penas orna.

wonder_bread 8 years ago |

Is this a dupe of FloydHub's blog post from a couple weeks back? Markup looks very similar

sly010 8 years ago |

I agree with the need, my issue is with reproduceability and generally things that are not defined by the sketch (behavior, different screen sizes, etc). If everything were fully defined, we could just write a compiler from the high level definition to the low level. The real problem is that static 2D images are not good representations of dynamically sized interactive documents.

adamsea 8 years ago |

Kind of related to this, what do people think frontend developers will be doing in five years, or, where will they migrate to as frontend (either via tools like this or like AirBnBs SketchToCode) becomes more automated?

maffyoo 8 years ago |

what about the converse of this? what about producing a basic set of content and letting AI taxonomise or organise it and then create a design based on design models. It could work both ways. I guess ultimately the problem will be solved both ways and AI could create entire experiences based on specific domains. Also, for those claiming this will never work i would suggest looking at some of the amazing stuff being done with AI right now. This kind of problem, including optimising and making best usage of html/css is almost tailor made for AI.

talmand 8 years ago |

Interesting outcome.

The final html code from the first example is not that bad. There's the usual problem of beginner coders of too much div wrapping that probably isn't really necessary. I'm curious if the system can also create the css. The css also suggests a beginner coder, such as an overuse of unnecessary clear classes because of the overuse of floats for layout. Or having extra class names for things that are easily handled by a parent class reference, such as a "last" class on the last li in the list. Although, it appears the css is just a template obtained online. More on that later.

The second example using bootstrap is a soft failure in my eyes. Although the html does render correctly in the browser, which is because browsers do their best to render crappy html, the code is rough. The main problem is it decided to render the head element as a header element. Compared to the first example I'm shocked that this is the generated output. The usage of bootstrap does pose an interesting thought in that the content section of the html is more precise than the first example.

My reaction to this is it's a decent try at generating a website based on very strict rules assuming that more than half of the website creation process still requires a human to complete. For example, I could see this working quite well if one were to design your mockups strictly be bootstrap, or a template, and provide that css beforehand. If the mockup is custom outside of the template/bootstrap css then it'll have to generate that css itself. Which I think I'm more curious if that's possible. Generating html is easy as you can establish ground rules of "use this series of nested elements for this situation" and so on. The examples provided could just as easily be created by a drag-and-drop system that allows a non-coder to build a basic website. For that matter, use a markdown to bootstrap converter and train your writers/editors on the bootstrap basics and off you go.

But it sure did look like a fun learning exercise. As a front end dev, I'm not worried over my future and would be curious to see where it goes.

meigwilym 8 years ago |

Looking forward to this being a photoshop plugin.

toblender 8 years ago |

Now we just need it to do the javascript ...

sureaboutthis 8 years ago |

Up next, an AI for creating paintings and art and Sistine Chapel ceilings.

No. This will never be a thing. It will be copycat pages, never anything unique.

gedy 8 years ago |

The really useful AI would be to take business data or workflow and generate layout and styles to eliminate the hand made mockup stuff.