What it feels like to work with Mythos

What it feels like to work with Mythos(oneusefulthing.org)

60 points by swolpers 2 hours ago | 53 comments

eithed 17 minutes ago |

What I find fascinating that there is so little substance in this article about the quality of produced code and the medium. Is the code documented and tested? Is it understandable and extendable? Is it secure? What language, framework, database was used? Author mentions judgement and taste - well, is the code tasteful? Will the model rearchitecture the entire thing if I ask it to add new functionality, spending another 9.5h in tokens? I assume that the research part is domain knowledge = how different types of travel translate to time making it presentable.

hypfer 13 minutes ago | |

Being the first to release an article gives you great SEO or whatever. Doing the things you've mentioned takes time.

Aperocky 3 minutes ago |

> This is a map that shows the distance you can travel in a given length of time, and the first one was created in 1881 showing travel times from London.

The first item on the article, the first thing it showed, was wrong.

It is 100% faster to go from London to New York in 1881 than Volgagrad. Or any of the Russian hinterland colored green or Turkey or Egypt.

Not a good look.

JumpCrisscross 31 minutes ago |

Anecdote: I fed Fable some models I’ve been hand verifying (basically, I sketch out a scenario for Opus to model, it builds it, I ask it to show me the math, I correct it, we iterate like this, then I double check its code to make sure the math matches the model logic). Fable found almost every error I found, and then had some interesting suggestions for additional variables.

It also burned through my usage quota like a late-90s Hummer.

cyanydeez 19 minutes ago | |

now for the best question: whats your ROI here?

PunchyHamster 11 minutes ago | | |

It will be great when the price of compute/memory drops to normal level!

mohsen1 6 minutes ago |

I have been using it for less than an hour so take this with a grain of salt of being excited for the new tech.

In a project like mine (https://github.com/tsz-org/tsz) I am constantly frustrated that models were not doing enough research and were not taking into account other situations. Again and again models would produce code that would fix one thing and break 2 other tests that were "unrelated".

With Fable it seems like tasks are taking much longer (I have not seen a pull request from Fable sessions yet) but reading the transcription of those sessions I can see how it is doing the right thing by not leaving any stone unturned.

As the article says, it's hard to communicate this "feeling" about models because it is very project specific but I thought I share

selfawareMammal 24 minutes ago |

What are people working on that they see such a substantial difference between Mythos and Opus? I'd say I'm working with advanced stuff and more than often Deepseek is even more than enough. Why is everybody a genius in here?

ianm218 10 minutes ago | |

I’ve been working on implementing some common web infra type projects in Rust lately. Basically trying to use a lot of the great primatives in Rust like rustls (modern openSSL) and Tokio (async) to build memory safe or close, nginx drop in replacements.

A small portion of this effort is having a high quality Lua in Rust repo. I’m using mythos to fix some of the performance issues with my Lua interpreter that gpt 5.5/ opus 4.8 had stone walled on.

Not sure if Mythos will be able to crack this but it has been running for a couple hours now with some promising results.

Performance charts linked here if your curious https://github.com/ianm199/lua-rs

mohsen1 4 minutes ago | |

I had a few of the benchmarks left alone and was working on tech debt knowing that a new model is going to be released soon. For my project (tsz.dev) Opus 4.8 was running in circles without producing results for a while for those tasks

garciasn 12 minutes ago | |

For me, DeepSeek isn't anywhere near as useful as Opus (I haven't yet tried Fable). DeepSeek would burn more tokens, get lost or provide shit output, and would NEVER take a two sentence prompt and one-shot a workable POC to start iterating from like Opus will.

I don't consider myself a genius; but, for my workflow(s), what/how I've grown accustomed to building over the last 6-9 months, and the speed at which I'm able to produce entirely new integrated platform features, DS just isn't cutting it when compared to Anthropic's models.

mervz 18 minutes ago | |

We see the same thing when new laptops are announced and every employee all of a sudden needs to upgrade, despite the fact that 90% of people would be able to make do with a Macbook Neo.

theturtletalks 25 minutes ago |

This is what he built:

https://isochronic-passage-chart.netlify.app/

Doesn’t work too well on mobile but looks interesting

skipants 10 minutes ago | |

It looks interesting but, like a lot of AI, looks correct but is not. Most of northwestern Canada says you can get there by road. If you look at Google Maps, there's no roads there for quite awhile. I see one highway between Inuvik and Tuktoyaktuk but that's about it.

gopalv 1 hour ago |

> It worked for nine and a half hours.

> Again, it wasn’t perfect. As an expert, I was able to spot some errors and omissions (some as a result of the design I had asked for) that I had the AI correct

That's the bit that stuck out to me - that's longer than I would expect to work on a problem in a day or even expect to go back & fix the output of something that has a core reward loop of hours.

My customers are currently clamoring to push down my agent response times from 85 seconds down to below the 20s mark.

At the same time, it is very dissonant to see the industry heading towards hour+ long workflows with an agent.

zuzululu 9 minutes ago |

> First, how good is Fable? In experiment after experiment I conducted, it outperformed basically every other public model I have used by a considerable margin.

What makes me excited is that GPT 5.6 (its actually GPT 6) is going to be crazy

thepasch 21 minutes ago |

What it feels like to work with Fable:

> Switched to Opus 4.8: Fable 5 has safety measures that flag messages on most cybersecurity or biology topics. They may flag safe, normal content as well. These measures let us bring you Mythos-level capability in other areas sooner, and we're working to refine them. Send feedback or learn more.

recursivedoubts 1 hour ago |

would it be possible for mythos to make the space bar scroll the pages on your website properly?

mulr00ney 31 minutes ago | |

Seems to be hijacked the video of some game they generated. :(

zb3 5 minutes ago |

Was the condition of being granted early access to this castrated model writing a post praising it?

382hi 2 hours ago |

I think Qwen 3.7-Plus is better at reasoning than Mythos, and I've used both for quite a while.

giancarlostoro 17 minutes ago | |

Would love to see samples of the kinds of prompts you use with both. I sometimes wonder if the specific wording is the secret sauce, I have very few issues with Opus / Claude, but when I try premier GPT models, I get weird output from what I've grown to expect with Claude.

asdK120 2 hours ago |

Mollick runs the Generative AI Lab at Wharton, with all the corporate sponsors.

He is a professor but sadly also an AI shill. He should switch to advertising washing power.

MostlyStable 2 hours ago | |

So...no engagement with the substance? Not even to explain why it is that this is not a useful description or test of capabilities? Ok.

dthread3 1 hour ago | | |

I would like to see it do something useful, like converting pytorch to golang.

CuriouslyC 10 minutes ago | |

Ethan is a booster but I wouldn't call him a shill. He cites data and mostly in a fair way, though you could argue the sources he chooses to focus on are biased.

whyenot 1 hour ago | |

Instead of attacking the author, please respond to the content of the article. That is the HN way, and it leads to more substantive and interesting discussions.

root_axis 2 hours ago |

I just can't stand this type of fawning language.

the_doctah 1 hour ago |

More Mythos Marketing.

boringg 27 minutes ago | |

The mythos of Mythos is marketing.

et-al 2 hours ago |

[flagged]

astrange 1 hour ago | |

It is not a sponsored article and he writes one of these every time a new model releases. Why would a professor at Wharton need to write sponsored Substack articles.

0x1ceb00da 1 hour ago | |

"I don't care who the IRS sends I am not paying taxes!"