Do data-driven companies win?(benn.substack.com) |
Do data-driven companies win?(benn.substack.com) |
My first take-away was that using data to make decisions is tremendously, tremendously powerful. A/B tests, in particular, can help determine causality and drive any key metric you want in the direction you want to. Short-term, it seems to work great.
Long-term, it fails. Being purely data-driven without good intuition and long-term bets (that can't be "proven" with data), and the product loses its soul. You can (and should) invest in metrics that are more indicative of the long-term. And you should use data to help guide and improve your intuition.
But data is not a substitute for good judgment, or for a deep understanding of your users and their problems, or of "where the puck is going". It's just a tool. It's a very powerful tool, but if it's your main or only tool, you will lose.
I wonder if A/B testing is a bit like that. You're suddenly listening to every single issue encountered by users, but lack the wisdom to understand which issues to ignore. As a result, software changes constantly, users cannot really learn a GUI because it changes so frequently, and software teams feel like they're constantly improving things, however the software itself is not actually getting better when a the whole user's experience is taken into consideration.
We know how to optimize for things like the shortest distance between two points but we don't understand how to optimize Software Design or even GUI.
This is literally what "Design" means. If something needs to be designed it means there's no theory or notion behind it on what it means to be "optimal." So we "design" a "better" solution but we don't actually know if it's better. So we design another solution and the cycle continues.
A/B testing when done on a population that gives consistent answers should converge on a consistent solution. If the population gives different answers at different times then of course there will be churn.
I would say the methodology of A/B testing is indeed like machine learning, it's quite good and accurate. It's the data source that's the problem. If you have users that don't know what they want or behave differently and inconsistently then your conclusions reflect the data. Perhaps the data is accurate and there is no consistent conclusion, OR the data is inaccurate and you need to get it from a better source other then users telling you what they think is better.
If you don't know how to properly construct experiments and interpret data it's honestly not much better than hiring an ornithomancer to spot omens in the skies. To be clear, this is something that even scientists with decades of both education and experience sometimes get wrong. If you're coming at it with an MBA I've got some concerns.
The entire problem here is that when people say "data-driven", what they actually mean is "model blind". If you make decisions based on a model, nobody will say it's data-driven, it doesn't matter how empirical the model is. Yet, if you rephrase the question as "Do model blind companies win?" the answer is obvious.
Running incoherent experiments testing for each little event you may think of is certainly better than walking at random. But as everybody knows for centuries, you use data to improve your model, and use your model to improve your craft. Jumping over the model part is an incredibly lousy use of data.
Data is useful for proving my decisions once I have made them, so what's most useful is having short feedback loops so can I catch bad decisions quickly, or reinforce good decisions.
I can't figure out what this means, but I have a feeling it's the exact opposite way you should be incorporating data into your decision making.
Generally, you can only generate data and do experiments based on your present state.
This sounds like post-hoc, anti-intellectual rationalization.
How do you place long-term bets without a model to measure expectations versus outcomes? How do you know what good intuition is? There is a ton of research on "gut" calls that demonstrates it's random.
>But data is not a substitute for good judgment
Good judgement requires data.
How is this anti-intellectual?
If you want, I can formalize as a game the problem of choosing business/product strategy in a competitive market with a continuous flow of imperfect information. I can then use ideas from controls to establish some upper bounds on what can be inferred from a continuous flow of information. I can then use that result to prove an impossibility result about the game. I can even tweak assumptions to get bounds on probability distributions which infer we'd be better off flipping a coin or whatever.
I'm not going to do the work, because intuition is almost always enough to identify these situations, but it's absolutely clear to me that results like this obviously exist and correspond to many real-world situations.
> Good judgement requires data.
It used to be that insisting on data-driven decision making was a hard pull. Now it's the opposite. Insisting on data where data cannot possibly provide enough signal to make a decision is the new form of anti-intellectualism. IMO.
I would say good judgement requires experience. Data may or may not be available and applicable, but its absence doesn't mean one can't exhibit good judgement.
Surely the data comes after the action, not prior to it?
Data is valuable. Some KPI's can show important, even vital information -- often not directly but through the first or second derivative. If something changes rapidly, you probably want to take notice.
Overall though, I think the metrics that we have available are about as helpful to a company as the metrics we can track about our bodies are to our health. Sure, monitor your weight, HRV or even your blood sugar levels and get some insight. Let it help inform your decisions. But only that.
On top of that I see companies with employees who didn't understand college mathematics but now want to be data driven - E.g. basing major decisions on 10 customer feedback data points. Or who don't know the difference between median and average.
E.g. if the metrics of customer acquisition vs customer LTV don't make sense on a fundamental level, it doesn't matter what your other metrics are showing.
the thirst of some people when I give them something that sounds like what they want. When they re-parrot last years findings back to me incorrectly because they only listened to the parts that backed up their instinctual beliefs. Makes me angry I waste my time on the data when all they wanted was an excuse. I coulda given them an excuse without having to do all that data digging...
I think fundamentally this suffers from the problem lots of VCs have right now. Cheap money meant that the correct investment strategy was to just pile money into high risk bets. I think this calculation looks very different without free money.
It will give a short term boost, but eventually you will have to look beyond your data and explore new things your customers really want.
The parable of the drunk found looking for his keys under the light instead of inside the dark bar where he lost them comes to mind. What's needed is a flashlight, however imperfect.
It's a bit like moving the furniture around the same room.
Someone has to have multiple good ideas and the ability to carry them out properly before an A/B test is even valuable. Otherwise you're measuring one uninformed random change against another. A/B confirms whether you've succeeded in improving something. It can't really suggest what to try next.
The truth is that data is only powerful if applied judiciously with solid knowledge of statistical fundamentals and careful thinking about causality (which will may not be practically falsifiable!). "Data" can also be misread and applied in big powerpoints with a reckless disregard for reality.
Enlightenment is understanding enough about data to use it correctly, but also acknowledging its limitations and that to create a successful business you also need vision and insight about potentials and trends for which accurate data does not and can not exist (except as a post-hoc trailing indicator).
The terms derived from philosophy are the most snobbish in my opinion. Additionally Logic and science formalize these concepts, Epistemology on the other hand has a lot of qualitative mumbo jumbo, with concepts like "belief."
Of course if data-driven means short-term with bad KPIs this version of data-driven will fail. I think a lot of people picture this A/B short-term testing when they think of it.
A data-driven approach is still far from objective and relies on choosing good metrics to optimize for.
To this end I would recommend the book "How to measure anything" from Douglas W. Hubbard (https://www.goodreads.com/en/book/show/444653.How_to_Measure...)
While I understand your point, it also depends on what you mean by "better". It's certainly safer, less risk, more predictable and makes planning easier. It also limits creativity and high pay offs.
Some one else commented that it was like "driving by looking only at the dashboard". You can do that very safely in planes, but you're limited in where you can go. I feel it's the same in business.
There is the option of you having an absolutely massive dataset available. It just seems a bit far fetch to assume that any company would have data that could allow them to move from developing a SaaS product to running a chain of burger joints, because the data indicates that would be a good move.
I think this fully depends on the metric that you choose. The metrics can be very creative.
> could allow them to move from developing a SaaS product to running a chain of burger joints, because the data indicates that would be a good move.
In this scenario the counterfactual is a company without data is taking the decision based on a hunch.
I think we are saying similar things. For me any decision that is rational is based on data (better or worse data, but _something_ is quantified). What and how to quantify from the real world to incentivise the right behaviour is the creative part, from my point of view.
Why would being data-driven limit either of these things?
Using data for decision making is more relevant for more mature processes where the data exists to learn from. Being data driven is especially important for data intensive sectors where the complexity of issues cannot be fathomed just by being brilliant, it's here you need to be able to analyse the intricacies using the data to find possible answers.
I just published a blog in response to this article explaining my thoughts on it - https://terminusdb.com/blog/some-data-driven-companies-actua...
Certainly works for us, but I don’t think this is how most people understand “data-driven” anymore.
That's to say that real innovation is being more data-inspired. Taking risks with what is known based on your intuition/opinion.
A timeless read on this topic is Clayton Christensen's "Innovators Dilemma".
I think the exact same phenomena applies to investing too. You can sustain your growth by making moves purely off data, but your big wins may come from being "data-inspired"
This blog somewhat covers these ideas in more detail: https://amplitude.com/blog/data-driven-data-informed-data-in...
The irony in this all is that nobody can tell the future and therefore what we know in the current moment is the best thing we've got. Even those with more experience, resources, and knowledge can be disrupted. That's the beauty of the world.
Is it about making money? Or is it about about using your money to gain greater control? Or some combination of both?
This article assumes its all about the money. But if - as I believe - we are under some hybrid corporate-governance system, where these 2 domains are working together whilst appearing to be separate, then the measure is not just money.
Data driven companies tend to use the data to quickly fill in dark spots, collect that income and tap the source dry, and then bureaucracy and nepotism set in and the script is flipped. Data is used to explain away why we lost our shirt and how it’s not our fault.
The VC is looking for the moon and trying to argue that more data would somehow make the VC feel less like it’s Las Vegas. Sorry it’s all Vegas unless you want to spread and settle for making few percent by investing in all 5 companies and not trying to guess the pack leader.
I've been in a company that claimed to be "data driven". There was plenty of data that wasn't a mess - and we used it to make a lot of decisions. But, of course, we discovered new uses for our data - and that had to be cleaned.
The distinction is that all data starts messy, but you clean some, then discover new dirt. So you clean that, then discover some more. Some companies have more clean data than others.
Bingo. And judging by many of the comments here, it's spoiled people's opinions on what it really means to be "data driven". I mean, imagine advocating for "good judgement" when so much empirical research says it's not repeatable?
Bad execs will find a way to manipulate any good philosophy toward their control
It's not realistic to have a dataset large enough to cover all situations, especially not those areas and opportunities that you don't know exist. That's also way Googles 20% projects was such a good idea. It allowed to business to grow and develop in directions they couldn't possible predict.
Prove (?), v. t.
1. To try or to ascertain by an experiment, or by a test or standard; to test; as, to prove the strength of gunpowder or of ordnance; to prove the contents of a vessel by a standard measure.
Thou hast proved mine heart. Ps. xvii. 3.
(Webster's 1913)
Which makes sense to me. Try something you think might work, then test it—is just plain old experimental science.
LBJ didn't want to "lose" Vietnam. LBJ remembered Joseph McCarthy who accused the Truman administration of losing China to communism.
JFK and Ngo Dinh Diem were murdered in 63, Gulf of Tonkin incident in 64, and full US armed intervention by 65. This was a rollercoaster ride that the US military didn't plan for and didn't want.
McNamara picked Westmoreland to lead the military effort. It was a disaster. Their lack of a coherent strategy was doomed from the start. The body counts and #bullets per enemy killed was akin to rearranging the deck chairs on a sinking Titanic. On top of their minds was the Korean War, when the US military operated at China's border and China intervened militarily. Hence, US military land operations was limited to South Vietnam. There were no plans to invade North Vietnam.
Nixon and Kissinger decided the end game was to breakup the USSR-China alliance. With China on the US' side, the USA no longer had a national interest in what happens to Vietnam. The US left Vietnam in 73 and the country was overrun two years later.
Here are some dubious assumptions:
* There is an optimal answer.
* Human preferences are transitive (if I prefer A over B and B over C, then I prefer A over C).
* Human preferences are internally consistent.
* Human preferences are stable over time.
Apply these assumptions to finding an answer to the question of what you should make for dinner, and you'll quickly see there are problems.
> question of what you should make for dinner, and you'll
> quickly see there are problems.
And, to amplify your point a bit, there are problems with A/B testing for which of my children I should love the most, or what I find beautiful or interesting; or should I A/B test out whether enslaving others works out well for me?
I chose extreme examples, but the point is that there are many, many things in human experience that don't lend themselves to easy, simplistic rules. It's genuinly hard to work through a lot of a lot of the issues that people face in real life.
With that said, it's it's also important to work through and to understand objective data as best as possible. For clearly defined and nicely-behaving problems objective data is certainly the way to go. The problem is that a lot of the problems people actually face aren't so easy to understand, and aren't so well-behaved.