Stop donating your customers' data to Google Analytics

Stop donating your customers' data to Google Analytics(dev.to)

433 points by bakztfuture 6 years ago | 156 comments

lbrito 6 years ago |

So the solution is to use Amazon's tracker?

I appreciate the problem and I would like to stop using GA in my static pages as well, but trading one privately-owned software from a tech giant for another privately-owned software from a different tech giant seems a bit ludicrous. I would readily swap GA for some decent open-source solution though.

falcolas 6 years ago | |

I'd say the following to this (very reasonable) argument against using AWS: AWS makes money by selling services, not collecting data. Should Amazon make the leap and start harvesting data from AWS for marketing purposes, the data from their analytics platform will be the least of our worries.

Thus far, AWS has proven to be safe for companies to host their data upon, and there have been no leaks of data stored in AWS into Amazon's marketing program. The HIPPA, PCI, and FedRamp certifications help back up their claims that a company's AWS data stays in AWS.

Nullabillity 6 years ago | | |

People said the same about Apple, and then it turned out to be bullshit.[0] And, big surprise, their customers did not seem to hold them accountable.

So in the end, paying for stuff just shows that you're a more valuable product to sell. And gives them a great primary key to track you by.

[0]: https://news.ycombinator.com/item?id=22106536

adreamingsoul 6 years ago | | |

Or is that wishful thinking? AWS and Amazon are in the buisness of collecting, storing, and processing data.

thrav 6 years ago | | |

This is why I use Prime photos, instead of Google photos. I need one, so I’ll take the paid one.

jka 6 years ago | | |

Over the long term it's possible that maximizing shareholder value requires monetization of additional resources.

The approach of 'today, company X is known to make money via Y, so we can trust them with private data' only works until that data becomes valuable enough for the company to invest in extracting.

akie 6 years ago | |

I'm only aware of https://matomo.org/ (formerly Piwik) as a good open source alternative to Google Analytics. Are there others?

jka 6 years ago | | |

I can vouch for Countly ( https://github.com/countly ) which is open source, supports a few different platforms (web, iOS, Android), and has a nice administrative web interface.

The web SDK also supports collection of client-side JavaScript errors, which is neat for tracking down bugs and things which might harm user experience.

eldridgea 6 years ago | | |

I'm a fan of Fathom[0]. The amount of data and insight is light compared to GA and others but if it meets your needs it's pretty great.

[0] https://github.com/usefathom/fathom

lgrebe 6 years ago | |

Check out https://simpleanalytics.com/

clarry 6 years ago | |

> I would like to stop using GA in my static pages

Why don't you? Do you really need this tracking at all?

exotree 6 years ago | | |

Depends on the job, purpose of site, and goals being accomplished. Hard to demonstrate value and improve digital business outcomes without GA (and other tools like HEAP Analytics).

C1sc0cat 6 years ago | |

And the freebie GA doesn't limit you to a 25% sample across the board basically the bigger the date range and amount of sessions the more sampling you get.

Id be interested to see how amazons et up would handle the set up I am backup analytics nerd for.

Major beverage brand hundreds of websites, multiple locales per site (a dozen or more for the big brands) on and has to handle roll up as well as well as custom metrics and dimensions.

lern_too_spel 6 years ago | |

In this case, AWS is contractually obliged not to use your data. My criticism of the article is that the author mentions that users are blocking access to third party trackers, but that means AWS Pinpoint set up in the way he suggests will get blocked as well. People who have that concern have to expose Pinpoint endpoints on their own domains or implement some other first party tracking solution.

pixelbath 6 years ago |

I see a lot of suggestions for free or open-source analytics packages, but I would refrain from recommending anything you haven't personally used.

I've tried to separate myself from Google in various ways, and one of those was to replace Google Analytics with open source software. I tried several; they're all either non-functional out of the box, or require significant time investment to even start approaching Google Analytics.

After losing about a month of stats (which matters when you're also running AdSense), I ended up going back to Google. It took the same amount of time to set up as when I initially set it up: around 2 minutes of adding the tracking code and uploading it.

AndrewStephens 6 years ago |

The headline is wrong, it should be changed to "Stop donating your customers' data to Google Analytics ... donate to another large corporation instead!"

There are much better options out there. Quite apart from the solutions listed in these comments, a better option is to reconsider whether you really need analytics at all. Maybe the answer is yes if you are a business trying to understand your customers. But not every blog and project page needs analytics.

runninganyways 6 years ago | |

Or you could write the 10 lines of JavaScript that'll do what 99% of people use Google analytics for

mateus1 6 years ago | | |

If you think you can replicate that with 10 lines you have no idea what Google Analytics is used for.

bibobap 6 years ago | | |

Feel free to post those lines

Avamander 6 years ago | | |

This really reminds me of the infamous "you can just use FTP instead of Dropbox" comment.

ptasci67 6 years ago |

A bit tangential but a quick click on the author's name in the article and their bio reads:

> ex-Amazon contractor, front-end lover, accessibility nerd, down for building cool shit, especially Vue.js and Amplify.js consulting

My alarm bells ring when the answer to "stop using X" is to "start using Y" where Y == company I worked for.

This isn't to say GA is or isn't problematic, but the article's bias is problematic.

coldcode 6 years ago |

It would be handy if people listed here all the alternatives that don't steal your customer info.

dempedempe 6 years ago |

This seems like more of a promotion for AWS Pinpoint than a criticism of GA.

benbristow 6 years ago |

Interesting that the page breaks if you're using Adblock because of Google Analytics being in the URL.

Shows me a fun 'You are not connected to the internet' page that lets you doodle on the page.

marcosdumay 6 years ago | |

Not on Firefox with uBlock Origin.

davidarkemp2 6 years ago | | |

I'd check your filters - EasyPrivacy has -google-analytics- blocked in the URL

I got to it by adding the following filter

    @@||dev.to/goatandsheep/stop-donating-your-customers-data-to-google-analytics-191?i=i$xhr,1p

benbristow 6 years ago | | |

Using Edge (Chromium) with uBlock Origin

tmikaeld 6 years ago |

The problem is usually competing with "free" and Google knows this, there are privacy respecting alternatives like https://www.visitor-analytics.io though.

II2II 6 years ago |

> Tracker blockers are increasing in popularity so consumers can protect themselves against this tracking, reducing the effectiveness of your analytics.

More to the point: there is probably going to be a bias in the analytics. Different people have different reasons for protecting themselves against tracking, but it is highly unlikely that people who are unaware of or disinterested in the issue will use a blocker.

unreal37 6 years ago |

"My competitors tracking solution is ridiculous. You should get your head examined if you use it. You should use mine instead."

Terrible argument.

modo_mario 6 years ago | |

You seems to have missed the arguments made tho. You get to avoid the cookies and as someone else pointed out Amazon doesn't use the data. It's your data.

tzury 6 years ago |

Did not read through, but from a quick look, I suspect anyone can grab the code, and fill in your AWS with terabytes of garbage data which will end up in an enromous amount of dollars in AWS billing.

Am I missing something?

choward 6 years ago | |

Machine learning. How can you say no to machine learning? Did I mention machine learning?

JosIJntema 6 years ago |

Interesting topic. This among others is one of the reasons we started building Harvest. Just as with Google Analytics, you can start tracking data with just a small snippet of Javascript.

We use Splunk as our data engine and you can install it on your own server. This way you have full control, access and ownership of your data without letting third parties get any data. In that sense Harvest is basically the infrastructure that allows you to collect, store, use and visualize your data.

Besides that, we have been focusing on features that will help companies comply with privacy regulations. It is proven that this is not always easy in the complex world of online data.

For more information check https://harvest.graindata.com/en.

snowwrestler 6 years ago |

The suggested Google Analytics implementation today is a collection of three separate Google technologies: the original GA, Doubleclick cookies to track demographics and interest, and Tag Manager to manage them.

The original GA does not give Google useful cross-site user data because it uses only first-party cookies and anonymizes data as it collected it. To my knowledge you can still implement GA this way If you want to. Such an implementation would be GDPR compliant in not tracking any personal data, although your counsel might still say you need to list them as “analytics” cookies in a cookie banner (mine did).

gpvos 6 years ago |

I didn't know about AWS Pinpoint before, but from what I can see, it only offers analytics for email and other messages, not for web pages, so presenting it as a full alternative for Google Analytics is misleading.

rchaud 6 years ago | |

The article doesn't even seem to mention anything for which GA is nearly indispensable. E-commerce analytics, conversion funnel visualization, customer segmentation, etc.

unreal37 6 years ago | |

The author wrote a tool to make it do that I guess.

projproj 6 years ago |

I have fun looking at these stats (sites with Google tracking vs. sites without) in a Firefox addon I made: https://bitbucket.org/tayler/google-spy/src/master/, https://addons.mozilla.org/en-US/firefox/addon/googley-eyes/

IdontRememberIt 6 years ago |

While running my first business GA was not really usefull (We used internal tools easier to integrate in the code and adapt to our needs).

However GA data showed its usefullness when selling the business. The data was considered as a trusted source of information for the buyer. And all the definitions (unique user, etc) were aligned with the buyer's, so it was easier for them to assess the metrics.

StreamBright 6 years ago |

I have created a simple workflow using AWS Lambda + Kinesis + S3 to track our customers and not to have any 3rd party dependency. It took roughly 2 weeks but it is worth it since do not leak customer data and we have much tighter control over what we collect (no PII except the source ip that gets hashed in the process).

ElFitz 6 years ago | |

FYI, if your setup relies on API Gateway, you probably could use VTL / Mapping Templates to directly send from API Gateway to Kinesis and skip the lambda altogether, like some do for dynamodb

See https://hackernoon.com/serverless-and-lambdaless-scalable-cr... And https://aws.amazon.com/blogs/compute/using-amazon-api-gatewa...

StreamBright 6 years ago | | |

Woo thanks! I did not know it. I might re-architect the workflow to have this.

macinjosh 6 years ago | |

> I have created a simple workflow using AWS Lambda + Kinesis + S3 to track our customers and not to have any 3rd party dependency.

Except for each of the 3 components you listed that make up your system. They are 3rd party dependencies,

whoisjuan 6 years ago | | |

Everything is a 3rd party dependency then. The only way to not have a 3rd party dependency is to build your own infrastructure and use open source solutions (and even with OS you're still dependent).

I think OP was clearly referring to a self-managed solution as opposed to a set of 3rd party services like GA, Segment, etc, where the flow of data is out for your control.

StreamBright 6 years ago | | |

I meant no 3rd party dependency on storing customer data that requires extra legal work in GDPR land. Maybe we need to include AWS in that though. I need to look into it how cloud vendors are 3rd party in that sense. Is there a difference between Google Analytics vs. storing data on S3 even if we do not collect PII?

gpvos 6 years ago |

http://archive.is/RAXU6

digitalengineer 6 years ago |

Why not spin your own? This tool comes with a lot of tools out of the box and can also run personalization techniques and more: https://harvest.graindata.com/en/store

fretn 6 years ago |

a few weeks ago my blood needed a checkup. They sent me the results by mail. The results where on a non password protected but 'unguessable' url. And the page ofcourse contained google analytics, I'm in the EU, I wonder if this is legal

prox 6 years ago | |

If they haven’t notified you, and hence you can’t/didn’t comply, it probably isn’t. Especially medical companies are scrutinized for following gdpr. You could make a case here to either the companies privacy officer, or your countries privacy watchdog.

https://gdpr-info.eu/art-39-gdpr/

pbalau 6 years ago |

You really have 2 choices:

Are you relying only on data you can get from your app? There is no reason not to build your own solution.

Are you relying on data you can't get from your app/website? Then you can only use GA, since FB does not have a service like this.

mariushn 6 years ago |

The main issue is that competition is basically cut out due to the free pricing of GA.

Very few businesses/people would choose to pay for something when GA is free. Why do that? To tell your customers "we value your privacy"?

leokennis 6 years ago | |

Don’t you think that, in today’s climate, customer knowledge and involvement about online tracking, fingerprinting, filter bubbles etc. is at an all time high? And that makes this the best time ever to indeed be proud to tell your customers that “we value your privacy”.

It’s one of Apple’s strongest marketing pillars.

brainlessdev 6 years ago |

Related: the discussion about Goatcounter https://news.ycombinator.com/item?id=22044854

sdan 6 years ago |

I'm currently building a free analytics service that's the fastest. Ever. Faster than Fathom, Simple Analaytics, pretty much everything except Google Analytics you can think of.

https://sdan.io/pingpong

Still building it, but you can sign up for when it launches here: https://forms.gle/MhojBWWfdiWjZatC7 (I know it's ironically on google forms and I'll move away soon)

> https://sdan.io/pingpong

october_sky 6 years ago | |

> I'm currently building a free analytics service that's the fastest. Ever.

How do you intend to make money from this free service?

sdan 6 years ago | | |

If you're getting over a million hits I might add some incentive to donate, but mainly I take this as my payback to the developer community. I'm launching another product in a different domain at the moment and hoping that can compensate.

At the end of the day, if anything goes wrong, I'll always be happy to open source the whole thing.

jammygit 6 years ago |

I have used simpleanalytics for a while. It offers a lot less granular information because it deliberately collects less for privacy reasons

oarabbus_ 6 years ago |

The problem is GA is orders of magnitude better than the competition for price to performance ratio.

drusepth 6 years ago |

Is there a free alternative that has feature parity with GA and isn't very difficult to set up?

snambi 6 years ago |

Yeah right, stop using GA and start using Amazon analytics. Very good suggestion.

aforty 6 years ago |

And firebase analytics is much the same as GA right? In fact I think it’s even viewed on the GA portal.

nrjames 6 years ago | |

Connecting Firebase to Google Analytics is optional, for now.

shadowgovt 6 years ago |

> And if you think that's okay, you should take your head out of the sand because consumers are demanding it. Please tell me how many of your users like the large cookie agreement popups that they have to dismiss...I-I mean read and accept just to consume your content. Agreements that you're forced to have them agree to because you're using cookie-based trackers like GA.

I think that's the heart of why I so despise the GDPR. In an intent to change site behavior, politicians passed a law putting a burden on sites that did an undesirable thing (rather than, say, making the undesirable thing itself illegal).

Perhaps they thought sites would avoid the burden.

Did they not anticipate full shifting of burden onto end-users? Because being able to know how a site is used is extremely valuable to the site's owners.

djsumdog 6 years ago |

I tried Matomo (Piwik) recently, but I only do log analysis and it doesn't really treat log access as a first class citizen. If you use Javascript tracking, it's probably the right way to go.

I switched back to AWStats for my personal stuff. It's probably too basic for business or company apps, but for your personal stuff without javascript/cookies, it's still a great analytics tool.

nicky0 6 years ago |

Anybody ever heard of server logs?