Trading Program Ran Amok, With No ‘Off’ Switch

Trading Program Ran Amok, With No ‘Off’ Switch(dealbook.nytimes.com)

143 points by mindblink 13 years ago | 92 comments

SagelyGuru 13 years ago |

'Prediction is difficult, especially with regard to the future'.

It looks like they were using some new algorithm, which should have made them a lot of money, had the market gone up after their massive purchases. In that case, they would have pocketed fat bonuses and would not be on the news.

However, it has not happened, so the crying and the search for a scapegoat is on. It sounds like the case of the banking business as usual: 'heads I win, tails you lose'.

Ultimately, there is a really serious problem with the concept of limited personal liability for companies engaging in speculation. It is an assymetric arrangement, whereby the directors are entitled to the profits but are never personally responsible for the losses. With such rules of the game, it is advantageous to take crazy risks. Expect to see a lot more of this and many more taxpayer funded bailouts.

yummyfajitas 13 years ago | |

Huh? Knight Capital lost a bunch of money, and will likely go bankrupt. Essentially, Knight's software bug transferred a bunch of money from Knight to everyone else. This poses minimal systematic risk to anyone else, and they will almost certainly get no bailout.

The markets have already recovered. The S&P was down a little bit on thurs and recovered by friday. Knight is down 60%.

http://www.google.com/finance?q=INDEXSP%3A.INX%2C+NYSE%3AKCG

This is ultimately a situation of the market being a robust and stable dynamical system.

white_devil 13 years ago | | |

> This is ultimately a situation of the market being a robust and stable dynamical system.

Not while it's being shaped by algorithms competing against each other, which you're a part of.

gkuan 13 years ago | | |

Yes, no risk to others unless you happen to be one of their newly acquired futures broker unit's customers with $411 million in deposits. http://www.reuters.com/article/2012/08/02/knightcapital-regu...

iskander 13 years ago | |

They weren't taking long-term speculative positions, their short-term market making had a major bug in which instead of buying low and selling high it was doing the opposite (thus losing a small amount on each pair of trades). For more details read Nanex's blog (http://www.nanex.net/aqck2/3522.html).

So, they launched a busted market making algorithm, lost a ton of money and no one is going to bail them out.

jonah 13 years ago | | |

And the nanex guess at why it happened: http://www.nanex.net/aqck2/3525.html

justincormack 13 years ago | |

They were not taking big positions expecting the market to go up, they appear to largely have been burning money buying and selling fast.

There is no government money involved.

SagelyGuru 13 years ago | |

Of course, buying high and selling low is always the 'reason' for making a loss. In this case, I think the program caught itself out by manipulating the market, which it perhaps naively assumed to be non-manipulable.

In other words, it was creating so much volume that, when buying (or selling), it made the market go up (or down). It was then reading the price as going up (or down) and jumping on its own bandwagon. This, of itself, would create growing oscillations in the market and growing losses.

For this to work for you, you need to first create a trend and then sit back and let the suckers pile in on it and take the losses. You then return only when you want to reverse the trend again, at a profitable level (for you). I suspect the program was just too fast for its own good and not a match for the human Masters of this art.

beagle3 13 years ago | | |

That's not what happened, according to nanex.

It just kept making markets in reverse (instead of joining the bid and the offer, it bid on the offer and offered on the bid).

Nanex speculates that Knight ran their tester software on the real market (the tester losing money on purpose to the main algorithm). Alternatively, it could simply be a bug that sends a bid instead of an offer and vice versa. One bit flip at the wrong place could cause that.

ams6110 13 years ago | | |

Guess their developers never watched "Trading Places"....

"Wilson!!! Get back in there and SELLLLLLLL"

svdad 13 years ago |

What I wonder, following this story this week, is how the software quality controls at a place like Knight compare with those for life-critical systems like those in, e.g., aviation.

On one hand, you'd think the QA in finance would be pretty solid, considering that the survival of the company could be at stake (witness Knight). On the other hand, I have a feeling that even there, people just don't take it that seriously.

Would love to hear from anyone with more experience writing software for these industries.

nmcfarl 13 years ago |

They lost $440 million (and amount greater than their market cap), and possibly the company, on what the world knows to be incompetence.

At some point if I couldn’t stop it - I’d be tempted to just kill the power to the server rooms, all of them. There just has to be a way to cut your losses.

bagosm 13 years ago |

So, some of the owners were looking for a way out, and magically this thing broke loose and started giving away (basically) free money to undisclosed receipients. In the meantime all the technicians were fast asleep and couldnt kick the machines down or something, while they were losing milions of dollars per minute. This article is a completely honest recap by completely honest people, about completely honest traders/bankers (bankers are not people).

Edit: on a COMPLETELY unrelated note, trading firms/banks are known to actively pursue the extraction of money from their clients with bogus trades/advice http://www.nytimes.com/2012/03/14/opinion/why-i-am-leaving-g...

fsckin 13 years ago |

I would rarely suggest this, but if something is so incredibly broken that you're loosing money at a rate of 800 million dollars per hour, screw the customers.

Turn it off at any cost. If you are forthcoming and transparent, customers will understand.

pheon 13 years ago | |

Point is you dont know what the loss is.

1) They bought too much stock (incorrectly)

2) realized WTF, stopped everything

2a) more likely their clients said WTF is wrong first

3) had to sell the stock for the rest of the day.

Its only after they sold everything did the $440MM price tag surface. Hopefully they sold most of their positions to goldman (instead free market) so one of their investors made a boatload of cash.. giving them favorable terms for a line of credit.

sp332 13 years ago | | |

This is wrong. The algorithm was buying and selling constantly, sometimes losing small amounts of money (usually about $15) each time, sometimes as often as 20-40 times per second for each of about 150 symbols.

wtracy 13 years ago | |

Another thread here has a link to a blog post with (admittedly speculative) evidence that, at two different times, they tried rebooting the system only to have it come back up and start making random trades again.

retube 13 years ago |

There seems to be a lot of confusion around market making, brokering, execution algorithms and HFT in this thread.

teyc 13 years ago |

I read the nanex article. Regardless whether it is true or not, the general trend is towards development of more sophisticated load testing programs.

The most benign ones were developed for use in IT systems. E.g. Apache bench. While these can cause disruptions if aimed at production services, this does not necessarily threaten the health of an entire enterprise.

However, the trend is that all software sectors are starting to adopt this particular technique of testing software with not sufficient regard to what happens if it is released into live systems.

For example, we have chaos monkey, from Netflix, which randomly shutdown services in a cloud based system.

What would happen if software which simulated meltdown at a nuclear facility was accidentally bundled into the build system by a tired operator? Or some one does the same with flight software?

The main software running trading platforms would presumably be supervised by another program to ensure that bad algorithms do not lose e company too much money. However there was no such tool for the component that generated the test data.

To me, it sounds like the supervision should be done at a higher level, e.g. A wrapper around existing APIs. All software running against live systems must call into the wrapper.

Secondly, test software should conduct some kind of verification. E.g. Check for evidence that it is testing against a Test system. This might be the presence of a nonexistent company, et c.

I am more than happy to compile any other ideas you may have so that the IT industry is able to build more fail safes into software.

We are starting to see some of these fail safes in practice. E.g. When you try to send out an email to everyone in the organization, email software may warn you if you are sure you want to do that. The problem is we haven't thought enough about these scenarios that we don't adequately address them.

Incidentally, over in Australia, the Commonwealth Bank suffered a major downtime when it's outsourcer HP accidentally pushed out system wide updates instead of doing this to select machines as originally intended.

sanxiyn 13 years ago |

Pure speculation. Maybe there was an off switch, which used to work, but not regularly tested, and silently broken? Wouldn't surprise me.

CaveTech 13 years ago | |

Highly doubt it.

alexchamberlain 13 years ago | |

Sounds highly likely!

sahilz79 13 years ago |

This was apparently an infrastructure problem of some sort: http://www.bloomberg.com/video/tom-joyce-knight-is-open-for-...

Infrastructure changes can be notoriously difficult to back out by simply using an "off" switch, particularly if this was some type of a firmware upgrade that impacted all of their production servers. Backing it out at a minimum would require some type of a reboot, which would cause problems with an active trades. It could very well be that they were running an Active-Active environment, they had to go Active-Passive, back out the changes from the passive environment, reboot, and surgically cut over to the passive environment. This could easily take 30 minutes.

elmarks 13 years ago |

Doesn't this mean that others made a killing, taking advantage of all the mispriced orders?

jonah 13 years ago | |

Most likely. c.f. the recent JPMorgan losses[1].

[1] http://www.pbs.org/newshour/businessdesk/2012/06/who-benefit...

jonah 13 years ago |

"Knight is also working with Goldman Sachs to help unwind the trades behind its extensive loss, according to people briefed on the matter.

"Goldman has agreed to buy, at a discount, the shares that the trading firm had accumulated. Such a move would help Knight by taking the portfolio off its hands and freeing up capital."

What does this mean? Why would GS do this? Why would Knight do this? Couldn't they just sell them on the open market at a better price instead?

jkimmel 13 years ago | |

Yes and no. Given the kind of volume Knight purchased during the faulty trades, it could be difficult to offload that many shares on the open market in a timely manner. Maybe those stocks are hot Monday, maybe they're not. Knight needs capital yesterday to keep floating, so they're likely looking to sell everything in one basket.

As for GS's motivation, they're buying at a discount. Due to the time sensitive nature of Knight's predicament, they're probably trading the portfolio to Goldman at a reduced rate. Unlike Knight, Goldman has the cash to sit on it for a while and sell the shares directly out into the open market, even if it takes a few days. Given the discount they bought the shares at, they're likely selling with a decent margin.

jonah 13 years ago | | |

So GS is primarily the go-to since they're willing to leisurely unload stock they bought at a discount and they've got the cash in hand to do so. Makes sense.

robryan 13 years ago | | |

Would be interesting if Goldman also made a stack off the original trades.

teekarja 13 years ago |

Strange article. Lots of text but missing the main thing I was looking for. What kind "erroneus trades"? where did the money go? If you buy stock at the market you did not intend to buy, why not just sell them the next day?

iskander 13 years ago | |

You can get more technical details from nanex: http://www.nanex.net/aqck2/3522.html

Essentially, they were buying high and selling low. Many times a second.

jonah 13 years ago | |

Seems like maybe they couldn't hold on to the stock for long enough to unload the enormous volume they were dealing with. It sounded like at one point they were doing AS MUCH VOLUME AS EVERYONE ELSE on the exchange combined.

http://news.ycombinator.com/item?id=4337750

Devilboy 13 years ago | | |

Since there's 2 parties to every trade doesn't that make 50% the limit?

noselasd 13 years ago | |

Because the program sold the stock it bought immediately, at a loss. Leaving you with nothing to sell the next day.

0x0 13 years ago |

Why isn't automated high-frequency trading banned already?

Does it not go directly against the spirit and purpose of having a stock market with proper investors?

retube 13 years ago | |

this is issue nothing to do with speculative trading. Knight is a broker. They provide an interface to the market for retail brokers, spread betting outfits and so on. Their algos execute orders placed by their clients. Their new algo had a bug. That's it.

0x0 13 years ago | | |

I don't know the details of retail brokers vs HFT, but this writeup http://www.nanex.net/aqck2/3522.html has a lot of charts showing trades with 25 millisecond intervals.

Just looking at things in a big perspective, the fact that the system is designed for allowing trades at such frequencies makes it seem like markets these days no longer exist for the benefit of the listed companies.

Then again, maybe I don't know wtf I'm talking about :-S I guess I don't understand how real value can be created from such a system.

reddiric 13 years ago |

Can someone smarter than me please explain why a system where trades got matched / executed at a granularity of once per second or once per several seconds wouldn't work? What would be the problem with exchanges accumulating and keeping secret buy and sell orders and executing them at a reasonable interval?

sgt101 13 years ago |

I wonder if the lack of a kill switch was linked to the power black outs in India - what if they set it going, got cut (power) and then only came back online 45 mins later? Could be that just one link - say a local exchange or power for an FTTP line failed.

tlogan 13 years ago |

I don't believe the problem was caused by "bug" - this reminds me of "rouge trader" stories.

It is kinda weird that all problems in Wall Street are caused by "bugs in software" or "rouge traders": while executives are never hold accountable.

SeanDav 13 years ago |

This is an extreme example of what what can happen when what should be a software company thinks it is some other sort of company. I am sure they thought they were a trading company and software development was the necessary evil required to get things done.

Well 400 million odd dollars in the red later I doubt they still feel that.

I do feel sorry for them and they probably didn't deserve this huge loss. Hopefully valuable lessons can be learned.

brokenparser 13 years ago |

Can anyone provide some context on this matter? What happened wednesday and where?

davvid 13 years ago | |

http://blogs.wsj.com/marketbeat/2012/08/02/knight-capital-tr...

Basically, they deployed a new HFT algo and it started buying high and selling low. oops!