Margaret Atwood Reviews a Margaret Atwood Story by AI

Margaret Atwood Reviews a Margaret Atwood Story by AI(thewalrus.ca)

93 points by goldenskye 2 years ago | 72 comments

danenania 2 years ago |

I wonder which model was used for this? Based on the poem taking "10 seconds" to generate, I'd guess the free version of ChatGPT, meaning 3.5 turbo.

While I wouldn't expect Atwood's conclusions to change too much by using GPT-4 instead, I think it's interesting that even the majority of educated people and journalists outside of tech don't seem to realize that the best model is at least 10x smarter than the free version of ChatGPT, which is what they seem to be using for all their prejudice-confirming "experiments".

They also always seem to assume that if the output from whatever prompt they came up with can't reach X quality bar, that means it can't be reached by anyone else either with a different prompting strategy.

Not trying to throw any shade toward Ms. Atwood, who is one of my favorite writers, and I'm also not claiming AI will be writing as well as her anytime soon... just pointing out that if we want to really measure where we're at on tasks like this one, a more rigorous approach is needed.

muglug 2 years ago | |

> the best model is at least 10x smarter than the free version of ChatGPT

Citation needed. What does 10x smarter mean here? There’s an ongoing debate about whether the word “smart” even applies to a text prediction engine.

datameta 2 years ago | | |

My gut metric says it's a ~20% increase in perceived interpretation and output complexity, whatever that means exactly. But there are plenty of eval result aggregators out there.

darkerside 2 years ago | |

I hear this a lot. I didn't notice a huge difference in quality with GPT4. Completely anecdotal, and could have been a failure to effectively prompt for that model. But I don't think it's safe to assume the results are 10x improvement.

CSMastermind 2 years ago | | |

I have. I don't propose some kind of scientific measure but I do have two data points to contribute:

First, I've been using GPT to build an application for work for the past few months and anything but GPT-4 consistently produces less consistent and reliable output. Things like occasionally producing malformed JSON.

Second, I have a set of questions I use to evaluate models testing different capabilities and GPT-4 does much better than other models, particularly at coding tasks. There are some exceptions, for example, Bard has been able to do better on stating facts sometimes and Claude has done better at summarizing long text.

I'd love to have another model as good as GPT-4 to use but I haven't found one yet.

rsynnott 2 years ago | |

> think it's interesting that even the majority of educated people and journalists outside of tech don't seem to realize that the best model is at least 10x smarter than the free version of ChatGPT

I mean... the content-free drivel they generate is more _polished_, possibly, though I'm not sure this is actually an improvement. What do you mean by 'smarter', here?

armchairhacker 2 years ago |

I think a while ago I commented something along the lines of "let me know when we see a successful book/article/speech which gets revealed to be largely AI-generated". And that hasn't happened yet: AI-generated content has always been noticeable and generally considered bad ("was X written by ChatGPT?" is an insult).

But I know AI is already being used to assist human writers, not just with boring emails and speeches, but creative works like articles and books.

Moreover, if AI ends up writing something decent, it won't be recognized as AI-written. And the human "author" probably won't be quick to reveal so; due to the controversy surrounding AI, and because then people would over-scrutinize it and just point out mistakes which even a human would make (or really minor opinionated things they call mistakes just to have a point).

Going back though, if AI ever does get to the point where it can replicate human talent, eventually we're going to know. If GPT-5 exists and is able to replicate human-quality writing, it's only a matter of time before someone reveals it, or a competitor catches up and then they reveal it.

famouswaffles 2 years ago | |

LLMs can hit human quality writing just fine (not professional yet). The top LLMs today are all deliberately trained to sound bland, robotic and uninspired with rlhf etc. It's just the default voice, not some weakness of LLMs and it's not very hard to make them not sound like that.

loudandskittish 2 years ago | |

Indeed, I've come across more instances of things that were supposedly AI-generated that turned out to have been made by a human being...

dools 2 years ago |

The Weeping Willows of Winnipeg is a shit story, but if you were working on a short story, and you got 5 suggested rewrites for a given paragraph, or you were looking for ideas for a plot point or something similar, then you could use ChatGPT to help you out.

In exactly the same way, sometimes I give ChatGPT a complete coding task and it can't do the job. But while I'm working on code I can get it to do certain things and it saves me a lot of time and sometimes comes up with very useful insights and things I was unaware of.

I'm sure authors (or anyone else whose job maps to "language processing") can use it similarly.

tkgally 2 years ago | |

My current job maps in that direction--translation and writing. Lately I have been using GPT-4 to produce first drafts. It saves me time and effort and gives me ideas for expressions that I wouldn't have thought of on my own. It's also good for writing in genres that are not my strongest. I wrote a PR brochure recently, and GPT-4's drafts had more advertising punch than what I usually produce.

I still spend a lot of time polishing the final version, so the time savings are only about twenty percent.

passion__desire 2 years ago | |

There is a story of a author who used to cutout words from newspaper and randomly arrange them in bogosort manner and find good combination of words.

At one time he found, "son sues father" which came true as a headline few years later.

inciampati 2 years ago | |

It's most powerful as a tool for transforming and projecting text with text. It also excels at lateral exploration, enumeration, comparative critique. But yeah, don't ask it to write you a story and expect a gem.

boplicity 2 years ago |

Reading this, I realize that the "prompt" is what's lacking, in terms of the output produced. A writer has a prompt too -- which is created through the entirety of their experience leading up to the moment they sit down to write, often heavily weighted towards the hours leading up to that moment. AI will catch up to actual human writers when it can spend a day's visual, auditory, and linguistic stimuli crafting the prompt that leads to moment of creation.

inciampati 2 years ago | |

Beautifully said. I can add only that the right pattern of prompts may drive the creation of legitimately useful and even beautiful work. Humans need to guide the machines, maybe precisely because they lack the evolutionary drive that gets us up in the morning and keeps us cleaning our babies' behinds.

m3kw9 2 years ago | |

More like a constant chain of prompts, except the prompts are exponentially more data and each output is combined with more sensory inputs for another inference and they happen in real time.

garba_dlm 2 years ago | |

it's not a prompt, it's a dialog, it's many prompts.

FrustratedMonky 2 years ago | | |

Yes When I have a conversation with GPT4, it keeps the history, and the entire history is a prompt for the next prompt.

I would say, not totally unlike a human memory. just GPT4 is limited to 8000 characters or something. Humans have a bit more, but with added 'forgetfulness', so who knows how many characters for real. Humans don't actually recall perfect text very well.

braydenm 2 years ago |

It's unclear what model was used for the outputs mentioned in the article, but here's the short story that Claude gave me for the same prompt: --- Here is a dystopian short story in the style of Margaret Atwood set in Canada:

The Snows Came Early That Year

The snows came early that year, blanketing the prairies in white before the harvest was complete. I awoke to howling winds rattling the panes of my solitary cabin, the radio readout blinking ominously, batteries too weak to check the weather reports again. I shuffled outside in layers of wool and coveralls, squinting against the icy gusts, to find the chicken coop door ripped clean off. Feathers and blood painted the snow crimson. The four hens I had left were gone, with only scattered tracks and droplets leading into the tree line to indicate the predator's escape.

I cursed under my frosted breath, the bitter wind stealing away my words. Food had already been scarce, each winter harder than the one before. I shuttered the coop and turned back inside just as the power flickered and went out. Now there was no heat, no light, no way to cook. The solar panels would be useless under the heavy snows.

As I sat wrapped in furs, watching my breath turn to plumes of smoke in the gloom, I made a plan to walk to town when the blizzard passed. Perhaps barter some handicrafts for a few days of rations. The trek would be risky, but staying here held a different kind of peril. The food stocks in my locked pantry were dangerously low. There was no choice but to go.

The Katskil wolves would likely be on the prowl, their numbers multiplying as the animal population declined. Stay to the road, move quick, keep head down. Don't attract attention. Don't look too frail, too thin, too cold. The robotic guards at the checkpoints would be indifferent to my starvation, but opportunistic thieves might not. This was the world we inhabited now. Hold fast to what is yours. Trust no one. Every day another fight to survive. My grandmother's voice echoed in my mind...from her days before the collapse: "Don't lose hope, girl. It's always darkest before the dawn." But that was in another era, now lost to us. There were no more dawns ahead, only the howling dark.

mariusor 2 years ago | |

I'm not sure if you're trying to say that this output is better, but in my opinion it is very much not.

I don't think that someone that wrote "you fit into me", would use the abundance of qualificatives that this text has. Furthermore there's nothing of Atwood's style that I can detect in it. The only interesting bit for me is the inverted assumption that the voice that speaks to us is male, but I feel like even that's too lame of a device for something a veteran author would do.

> you fit into me

> like a hook into an eye

> a fish hook

> an open eye

michaelmrose 2 years ago |

I don't know anything about poetry but I do know you have to put some effort into a prompt and iterate to get anything useful out.

Here are some interesting results from literally feeding this article and prompting chatgpt to improve the poem and take into account her critique. They seem at least to my eyes a little better.

===========================================================

    Upon these shores, where time seems still,
    PIBO stands, guided by a passioned will.
    With hands ungloved, their touch sincere,
    They embrace nature, drawing it near.

    Loving straightforwardly, without pretense or guise,
    In every bird's song, a part of them lies.
    In this dance, where boundaries fade and intertwine,
    Your journey, dear bird, feels much like mine.

    In the quiet of Pelee, where heartbeats merge with the sea's hum,
    There lies a tale of dedication, of hands and feathers becoming one.

===========================================================

    Upon Pelee's shores, where the migratory journey starts,
    PIBO stands undeterred, with gentle, gloveless hands and hearts.
    No insect repellant, for it might be toxic, they bear,
    Yet with every sting and bite, their dedication's rare.

    Warblers greet the dawn, singing their age-old song,
    Yet amidst them, mosquitoes and chiggers throng.
    With delicate touch, they band, note, and release,
    Suffering in silence, so our joy in birdsong doesn't cease.

    In the hum of nature and the dance of skies so grand,
    Echoes the profound mission, of PIBO's ever-watchful band.

ACow_Adonis 2 years ago | |

It reads like a someone tried to commit poetry by rhyming dot points in a list. Except they got a discount bulk-rate on purchasing commas at the grammar store so they stitched it all together with 50 of them and called it done :P

Plus like allmost all of its output, it appears to have the amount of profundity, aesthetic pleasure, insight and interest usually found in corporate boardrooms and marketing brochures. That is to say, vanilla dross.

jfengel 2 years ago | |

It's still doggerel. The rhymes are forced and the syntax stilted. The praise is vapid and lacks any sort of insight.

It reads like a college fight song. I'm good with replacing college fight song writers with a computer.

hyperman1 2 years ago |

After reading the end of this, I have the feeling I should read some classroom chatGPT's version of 'childrens stories in the style of H.P.Lovecraft'. Halloween is near anyway.

abledon 2 years ago |

> So sleep well tonight, dear authors. Your vocation is safe from the pod people. At least for now.

hahah "Pod People"... is that what SV residents are now?

soulofmischief 2 years ago | |

It is a callback to the first paragraph in the article, where Atwood makes a reference to the movie Invasion of the Body Snatchers, in which an alien race breeds clones meant to replace people in society.

https://www.imdb.com/title/tt0077745/

userbinator 2 years ago | |

Not that far off from the truth: https://news.ycombinator.com/item?id=37638563

jaza 2 years ago |

I'm quite impressed with "The Weeping Willows of Winnipeg", I'm happy to commend ChatGPT on it more than Atwood does. Sure, Atwood is an acclaimed professional, but hey, I've written dystopian short stories worse than that! Although mine probably involved less plagiarism than ChatGPT's.

dools 2 years ago | |

> I've written dystopian short stories worse than that!

Is your name Roger? Sub-question: do you have any nieces or nephews?

jaza 2 years ago | | |

Sorry. No and no.

rossdavidh 2 years ago | |

I found myself unable to read the whole thing. I'm not sure exactly why, but it was a tedious task to keep reading it, so I skipped past the last half to Atwood's commentary.

hoseja 2 years ago |

Wait, she's being serious about the criticism? It actually seems tongue-in-cheek to me, those are perfectly serviceable bits of text that are an existential threat to mediocre wordslingers the world around.

esafak 2 years ago |

See also Douglas Hofstadter's review of a GPT4 essay prompted to be written in his voice.

https://www.theatlantic.com/ideas/archive/2023/07/godel-esch...

flenserboy 2 years ago |

Are there any turnkey engines designed to run locally which can be trained on your own data? I've been itching to put my work into one, just to see what the results might be.

HPsquared 2 years ago | |

Llama 2 and various derivatives as the model. Get quantized models from https://huggingface.co/TheBloke

Oobabooga text-generation-webui for the server.

In the interface, use ExLlama for GPU inference (fast; for smaller models which fit in VRAM). Llama.cpp for large models (higher fidelity but slower), CPU+GPU.

13B parameter 4-bit quantized model (type 'GPTQ") can fit in a 12GB RTX 3060. 24GB card (e.g. a 3090) needed for 30B model on GPU. Something like 5-10 tokens/sec.

Can run 65 or 70B parameter models on CPU (e.g i7 12700) with 64GB RAM (also need decent GPU as above). Around 1 token/sec. These models are type "GGML" / "GGUF".

Long prompts take a long time for initial ingestion on CPU+GPU, much faster on GPU only.

Llama.cpp also apparently runs very well on Apple silicon, with the shared memory between CPU and GPU being well-suited.

flenserboy 2 years ago | | |

Much appreciated!

starkparker 2 years ago |

So can someone post a children's story written by Anaïs Nin, as requested? Should be fun

strken 2 years ago |

It's funny reading articles where the author is directing smug superiority at an ML model. "Haha, Toyota Camry! You may be able to move faster than I can run, but can you move up a ladder?"

galkk 2 years ago |

That criticism reminds me articles/videos where professional musicians critise some famous hit songs. Musicians sometimes can comment that guitar is ahead of drums or vocal doesn't hit notes quite well. The things that I as not musician doesn't hear at all. And this is also common topic in their reviews, that one should write music for audience, not for other musicians.

I'm not native speaker and many weirdnesses of the text may go past me, but I can say that for me the commented texts (especially the 2nd one, about post apocalyptic Canada) are completely passable and much better that what I will be able ever to write.

Yes, it may be not a threat (yet) to professional, especially established author. But they will be good helpers for people like me, who can get suggestions, improvements and illustrations just for the price of my 4090 and time to tinker with models.

No gpt was used for writing this though.

huytersd 2 years ago |

That short story was well written.

NateEag 2 years ago | |

What did you enjoy about reading it?

What was the plot?

Which characters leapt off the page, so real you could swear you met them last year?