Home Social Media Issues Are Analytics Services Sharing Your Personal Browsing History?

Are Analytics Services Sharing Your Personal Browsing History?

August 27, 2010

A huge part of social media measurement involves studying the Web analytics of your blog (or your company’s blog). There are numerous free and subscription-based services available that will tell you all kinds of things from how your traffic compares to the competition to what search terms people are using to find your blog. Compete.com is one such analytics service.

I think my issues with faulty metrics are fairly well documented. I know to take all data from these types of sites with a grain of salt. After all, samples can lead to educated guesses at best; terribly inaccurate data at worst. And I’ll give Compete some credit for getting at least some information (close to) correct about some of my sites — generally their unique visitor estimates aren’t too far off for me.

I take issue with other data the company is providing though — data that I consider to be a violation of my privacy, and data you might be unknowingly sharing as well. This post is about sharing something I recently discovered occurring on Compete.com, as well as my opinions on the ethics and defenses of it.

Unusual Referrals and What They Told Me

Periodically I run simple site comparisons through Compete’s free tool to see general trends — usually my primary blog and competitors / colleagues in the niche. It’s a good way to see if overall the niche is seeing increases in readership or how my blog is faring compared to others. And that’s fine.

But when I ran a search in July, I noticed a section in the results called “top destination sites.” These are sites people are visiting after they visit my blog. And some of them didn’t make sense. I knew for a fact that my blog was not referring traffic to some of these “top” sites shown. I checked my internal stats. And I was right. One site on the list was another one I owned. I checked the internal stats on that too. And indeed there was no traffic going from Blog A to Blog B.

Compete.com Top Destination Sites Report (free version)

OK. Then hold on. How can these sites be showing up as referred traffic? I realized something — these weren’t sites that my readers were being referred to from my blog. They were other sites I personally visited frequently. They were sites I visited directly via type-in traffic — not by clicking a link.

Note: My primary blog is set as my homepage in the browsers I use because I check the admin area frequently during the week. Therefore that page is naturally displaying when the browser first opens, and before I type in any other website URL into my navigation bar.

Well, hold on again. Why the heck does Compete know what I’m privately typing into my Web browser’s navigation bar, when that information should be between me and the site I’m directly visiting? Let’s just say I was “not happy” when I realized what was going on.

Essentially, here’s what we had:

My personal type-in browsing history was showing up for all the world to see via Compete’s free search tool (not just to paying members).
Anyone who knows me at all could come to the logical conclusion that these sites were my own browsing history and not that of my general readership (making it personally identifiable).
Sites that I might not want associated with said blog were showing as being “referred” by my blog — which implies some level of support or connection even when none existed. Even where a connection did exist (my ownership of two sites), that connection was not one of Site A referring anyone to Site B. The two sites should not have been connected in any way publicly in these stats.

I’m a big stickler for privacy. I read terms and conditions far more often than your average Joe. When companies specifically ask if they can collect information about what I’m doing on my computer (usually under the guise of reporting anonymous data back to them so they know how their software or whatever is performing), I say “no.” I’ve also yet to see any terms and conditions specifically grant a company permission to sell my type-in browsing history for public reporting in any way that could be even remotely personally identifiable.

Well, I contacted Compete before writing this article, and I had a chance to speak with Compete.com’s Director of Product Management, Eric Austrew. And it’s clear that we have very different views and definitions of what things like “referrals” and “invasion of privacy” are. That’s fine. As Austrew himself said, reasonable people can disagree. Of course we can. Then again, when it’s my personal information being shared publicly I have to be blunt and say I really don’t care how much a company wants to defend their definitions. I’d say I took it far easier on Austrew than I wanted to (if you remember my work at NakedPR.com, you know what I mean), but I do have to give him some credit and thanks for hearing me out given how passionate I still was about the issue.

That said, you might remember that I used to run a small PR firm. You might also know that I made it a point to set myself apart from the traditional sleazy PR image of hype and spin and constant “corporate speak.” I despise corporate speak — with a passion. When yes / no questions are overanalyzed or avoided (we’ll get to that), I start twitching. When someone I’m interviewing is constantly going out of their way to seemingly “stay on message,” I’m sickened by what usually comes across to me as a lack of authenticity that seems to hang in the air. I have little tolerance for it, and this was no exception. But we’ll get to that too.

What I want to do now is open up a few areas of discussion with readers here about this issue and what can be done to protect people’s privacy. Let’s get to it.

Referrals: Let’s Define

What exactly does it mean to say that you “refer” someone to something? To me it means that either you’re recommending it or you’re directly sending someone to something (in this case a website). It implies knowingly directing someone to a resource, whether that be in support of it or not (such as me linking and “referring” you to Compete’s site in this article regardless of actual support for the service).

Compete’s definition of “referral” seems quite different than its generally accepted use (and having been a webmaster for quite a few years, and having worked heavily with them through my former PR firm, I can tell you that the term “referral” does not generally include what someone happened to be looking at before they became a direct visitor via type-in traffic).

Austrew acknowledged that Compete defines “referral” differently than I do. And again, that’s fine. But I consider it less fine to use confusing terminology that can lead visitors to infer untruths, as in a site or its owner actually referring people to sites they wouldn’t dare refer. And it’s definitely confusing.

According to Austrew, even though type-in traffic is shown in the free sampling of “top destination sites,” it’s broken down further for paying members with a better explanation of what that data means. Again, fine. But what’s not fine is not breaking that down for every single visitor who has access to any of this data using Compete.com. Telling some people what something means but not others is irresponsible at best (in my opinion).

The phrase Austrew used for this traffic was “true referrals.” In other words, a true referral in Compete’s view is any site you were viewing when you left to go to another site. He used the example of BestBuy.com and CircuitCity.com. And from a corporate perspective I can see how someone might find it valuable to know a direct visitor was on a competitor’s site first, and that they then chose to visit you instead. But Compete doesn’t only report on corporate sites.

Also, “true referral” is hardly an industry standard phrase, and personally I consider it highly misleading to refer to something that could be complete happenstance as a “referral” of any sort. In fact, “true referral” is a term that’s been used to refer to multiple things including:

Actual relationships in referral networking (as opposed to referrals from people who know you but who have no actual experience in business with you);
Original referrals in Web traffic (the first site that directed you somewhere and exposed you to another site as opposed to later links you might have followed there).

So let me ask you. If you were doing research on a competitor’s or colleague’s site, and you looked up their “referral analytics,” would you get the impression that this was traffic actually referred from their site to somewhere else? Knowing that isn’t always the case now, if you visited a small blog from someone you know do you think you could probably pull out personally identifiable browsing histories, separating it from the relevant referred reader traffic? How do you feel about it?

Invasion of Privacy: What is (or Should be) Private?

I noted that Austrew and I also seem to have different views on what constitutes an invasion of privacy. While I do understand that Compete claims all traffic is opt-in, I personally do not consider it “opt-in” when it comes to personally identifiable information — no matter how rare of a case mine might be. To the best of my knowledge I’ve not once clicked an opt-in box agreeing to let anyone share potentially personally identifiable information. I’d actually consider opt-in boxes in general to be far too rare these days. Instead so much is tucked into big blanket terms & conditions or terms of use statements littered with legalese, which companies know damn well most consumers don’t read (or is it that they can’t understand it?). Companies know precisely what they’re doing in this sense, and it’s how they get you to agree to just about whatever they want, and you might not be any the wiser. (Talking about the 3rd party services here.) Even then though, there’s a reasonable expectation of complete anonymity when data is sold or otherwise given away — several sets of terms I’ve read since yesterday have blatantly said as much. And to me, a lack of complete anonymity when that data is transferred and / or published is an invasion of my privacy.

But I asked what I think was a reasonable question. I asked if Compete had a way for me to opt out. They did not. I was informed that to opt out of the data mining and selling / sharing, I would have to opt out through the 3rd party I had a relationship with. So I asked how I could find that 3rd party (I mean, seriously, how many services are we all a part of these days?). But Compete won’t reveal the identities of their partners, so there’s no easy way for me to say “Okay, I’m a member of X, Y, and Z, so let me go find out how to opt out with them or cancel the service.” I guess it’s nice that at least somebody’s privacy is protected that thoroughly. For me though, it feels nothing short of trapping and is reminiscent of Facebook previously making it incredibly difficult for people to delete their accounts once there — after all, the value is in keeping the information, right?

The Spam Comparison

Now take a moment and think about this in comparison to spam. Essentially you’re more protected from spammers legally than you’re protected from companies publishing this kind of information.

It starts in the same place sometimes. You sign up for a service. You give them your email address for legitimate uses. Somewhere in the terms that you have to agree to if you want to use that service it states the company is allowed to sell or rent your email address. Sometimes there’s a special check box regarding allowing “partners” to contact you, although again this seems to be far too rare.

That email address is included in a rented email list. Someone purchases that list and they email you, promoting their products and services. So far, all by the book. Here’s the thing though. They have to give you a way to opt out. And no, I don’t mean the original service provider. I mean the people who purchased the data and proceeded to email you. If they don’t give you a way to opt out, or if you opt out and they continue to contact you with those marketing messages, that’s spam.

So tell me…. Why should equally personal information like browsing history be held to lower standards? Why aren’t the 3rd party users of this data required by law to let you opt out of that data collection and publication if you do find that you’re one of the exceptions to the rule? And I mean opting out through them — not trying to track down a third party, when it’s highly unlikely the terms blatantly say “hey, we’re the ones providing your information to Compete.com.” It’s my opinion that they should be. What about you?

On a side note, after first being told there was nothing Compete could do to help me opt out or discover the third party that apparently led to an opt-in status, Austrew did offer to help in an email. I plan to take him up on that offer, and I’ll update here in the comments if / when that happens.

Where is this Information Coming From?

This is really my big question in this whole issue — who did I sign up with that led to an opt-in so I can now opt-out, given that information displaying in my site records is indeed personally identifiable to people who know me. Compete obviously can’t tell me who their partners are. But I had a thought — “hey, my browser tracks this information, so I wonder if it’s in the terms I agreed to when I installed it.”

Actually, I use three browsers. In Web development you don’t honestly have an option — you have to use all major browsers for testing. But the data that displayed in July would have been from June. I mostly used Firefox then, with a bit of Chrome tossed in. So I checked their privacy information. Firefox’s privacy policy is confusing beyond all hell, and that’s coming from someone who usually has a pretty fair grasp on the legalese. To their credit, the language is actually remarkably clear. It’s the information that isn’t. Early on it sounds like your information will stay within the Firefox / Mozilla community, and later on they say they can provide your personal information to certain third parties. So what is it? Is my information private, or is Firefox providing it to others? If so, who?

So hey… why not just ask Compete’s representative outright if they partner with any Web browser software companies? Well, I did.

When I spoke to Austrew on the phone, I asked this question. I mean there are only three main browsers for PC users. If one or more is selling this kind of browsing history information, we should know about it. The answer I received was somewhat vague, but basically amounted to it being an unlikely situation. Of course that’s not exactly a firm “no” either.

After that conversation I had another thought — what about our ISPs? Again, most Americans have limited options in this sense. Would we have to cancel our Internet service to “opt out” of this kind of data sharing and get around those terms of use? So I asked that question too, and re-questioned Austrew about the browser issue so there would be no chance of mis-quoting him on that.

Here are the questions I asked:

1. Does Compete partner with any ISPs? (As in, to opt out someone might have to go so far as to cancel or switch their Internet service provider to get away from those terms and conditions.)

2. Does Compete partner with any Web browser software company where the simple choice to use a browser could result in opting in to sharing data with Compete? (As in, the use of browser X means you’ve opted in to having type-in data shared — not situations where Compete might partner with the developing company in some other way, given that Google, Microsoft, etc. clearly have business beyond browsers.)

And here was his response:

To answer your questions, our panel consists of multiple sub-panels – basically, different recruiting sources – that that fall into one of the following categories: proprietary panelists that we recruit, and clickstream data that we license from ISPs or desktop application partners. We follow industry best practices for ensuring that the resulting clickstream is anonymous.

So yes, your ISP might be tracking your type-in browsing history and selling off that data to Compete and similar companies. I can tell you how unhappy that makes me, but I’m sure you can already figure that out. I’m even less happy about it given that I’m not actually the person who signed up with this ISP — so your spouse, or roommate, or someone else involved might have agreed to something that tracks your personal data without you even realizing it.

As for browsers? Well, I still haven’t seen a “no.” But they’re not even directly referenced in that response. And they are “desktop applications.” So I still don’t know what exactly to tell you on that front. That concerns me.

As for the comment on best practices and anonymity, that’s all well and good but….

Sometimes “Anonymous” Data Isn’t Truly Anonymous

Here’s the thing. It doesn’t matter in the slightest if your browsing history data is anonymous to Compete. Or to your ISP (or whoever else is collecting the data). It doesn’t matter if things are aggregated. What matters is if the people researching your site can identify your browsing history through the data provided there. Actually, if even one person can do that, the information is no longer technically anonymous.

There are other potential issues too. What if the information reflects your website referring people to a site that’s completely and utterly inappropriate? What if you’re a teacher who blogs, and your blog were to show that it refers traffic to a racist hate site? You might never have gone to said site. You might never have referred to that site in any way. But a malicious user could repeatedly open your site, then directly type in that site’s address, making the public record show that your site refers traffic to them. For a huge site with a ton of traffic, they might not be able to influence anything. But what about for a smaller blog? The exact wording at Compete.com is “websites getting traffic from [insert your site here].” That statement is not technically accurate. That site would not be getting traffic from yours. It would be getting direct type-in traffic that just happens to be in the same browser window as the site previously viewed. And anyone who’s been in the Web development game for a while knows there’s a big difference.

referral — Referral Language on Top Destination Page (free version)

People can be malicious. If you’ve ever been under serious fire from a troll or competitor, you know how it can be (*raises hand*). You have a right, in my opinion, to not have your site falsely associated with things that could be viewed as some sort of relationship by an average user, which could come across as defamatory if implying any level of support.

Now is that likely to happen? I don’t believe that for a second. But I went ahead and ran a few searches on blogs in the freelance writing niche (the niche of my primary blog). I looked up information on competitors, colleagues, and friends. The simple truth is that when we’re talking about professional blogs of individuals, those three groups have a lot of overlap. You know a good deal about the people you’re researching, and it makes information that much more potentially personally identifiable.

In checking just a handful of sites, I found some interesting results, and I asked the site owners about them:

One blog was tied to a black hat community. For those not familiar with the term, it has negative connotation — meaning someone who blatantly ignores or tries to get around the rules to manipulate things like traffic stats or search engine rankings. This professional writes for online business owners, and an association with that kind of community could cause them to lose business. This wasn’t just a top destination listed — it was the only destination listed. In this case the writer also is in a profession (in addition to writing) where they have to go out of their way to “keep their nose clean” when it comes to being associated with anything potentially questionable. In this case I could do a simple search and discover that nowhere did their site link to this community. They do not visit that site, and knowing this person fairly well I wouldn’t assume that they do (and it’s not such an awful thing that I’d care if they did). That leaves a couple of other possibilities: 1. The individual’s partner works on the Web as well and could have visited the site frequently. Or 2. Traffic was coming the other way around, and “back button usage” or re-typing the URL to return to the community could explain it. I’m just hypothesizing here.
In another case I was able to very easily identify a frequently-visited Web comic of a colleague. In this case fortunately there was nothing inappropriate that would have led to a negative association. But it is still none of my business what they are reading, and I should not be able to easily determine that by viewing freely available “top destination” data on Compete’s website. More importantly, this isn’t just a colleague. The individual also works for me. I’m a client. Had this research shown something highly inappropriate, there’s always the chance it could have affected the working relationship. I think the worst you’d probably find in my own browsing history would be an online dating site or two from that phase a few months back. While I’m not embarrassed by that in general, I can say I’d rather my colleagues and clients not have been able to look me up there when a profile was still active. Those things are kept separate from work for a reason. Would you want clients knowing what you frequently look at online?
In another example, I looked up a writer whose site was showing that a torrent site was a heavily referred destination. They didn’t even seem to know what a torrent was up front. And while there’s nothing inherently wrong with torrents, many of us do know that they’re often used in copyright infringement scenarios, giving them a negative image — especially in an industry where copyright infringement is a particularly serious issue. There were no links from the writer’s site to this torrent sharing service. How it ended up as a top destination site is still a mystery in that case, but it’s another example of how a not-so-“true referral” risks the image of the supposedly-referring site when it’s being researched by colleagues, clients, or others in the industry.
In still another example, I looked up a site run for military families. A top destination was a gaming site — again, no direct referrals from said site to the gaming site. Now you’re talking about a family-oriented site and gaming — and I’m sure you know the ultra-violent reputation a lot of military-centric games have. In this case the culprit seems to be a contextual advertising network. The site owner now knows to try to ban certain types of ads from displaying on the site. However, as anyone who’s ever run a site with contextual advertising networks can tell you, you don’t generally get to hand-pick your ads. When you see something offensive, you can usually block the advertiser. But it would be extremely difficult, if even possible, for you to know every single ad that appears on your site. It varies not only by page, but by page load. And ad inventories frequently change. And given that the javascript ads are designed specifically so that the traffic “referral” is not counted by search engines as a credited link for rankings, it would be a logical assumption that they’re not being tracked as referrals in general. Not the case apparently when the page is viewed after a pageview on your own site.

That’s what I found in just a few minutes of searching. Eight sites were looked up. Five showed strange results. Four of those people got back to me. And one of those four had personally identifiable information show up (just as anyone who knew me at all could have come to that conclusion about my own site’s results).

Is it happening constantly? No. It’s not. But two is two too many. (Wow, say that three times fast.) And keep in mind I’m not talking about corporate sites here. Many blog communities are fairly tight-knit like ours is. Colleagues get to know each other fairly well. And sometimes those relationships aren’t the most friendly. There is a lot of information that could be easily inferred, and shared by people who should not have it. We aren’t talking about nameless, faceless corporate sites, but ones knowingly owned by individuals where the people who need to research those sites are the very same people who likely know the owners.

There are many types of sites you might not want showing up there as referrals (when they’re indeed not referred to by your site at all). It could be a porn site. Maybe an extremist political site. Perhaps you visit a religious site and you don’t think your faith should be of concern to people visiting your blog. Or it could just be another website you own. Many people, myself included, own numerous sites that we don’t publicly associate our names with (and no, no adult-oriented or shady sites here — sorry to disappoint). The fact that you visit these sites directly is no one’s business just because they’re checking stats on an unrelated site. Of course, that’s just my opinion because I was one of the people with identifiable information being displayed.

Protecting Your Private Information Online

I’ve always considered myself fairly careful about online privacy. I’m very calculating in what I choose to share and keep to myself. There’s a difference between sharing personal information in a post with my readers and giving up something like browsing history which I would consider more “private” than just personal. But somehow I got pulled onto this panel without realizing it (and hopefully they’ll be able to help me leave it). What can you do though?

Run searches for your own blogs and sites to make sure nothing potentially personal is being displayed.
If you’re on the panel and you know how you signed up and you don’t want this browsing behavior shared, then opt out through the third party service you used.
Don’t save one of your websites as your browser’s homepage. Yes, I think it sucks that you’d have to change your browsing preferences to maintain what I believe should have always been private or at least unidentifiable but if it helps, it helps.
Make it a point to open brand new tabs or browser windows, or at least visit relevant and / or completely “innocent” sites if you’re starting out with your own site in your browser window (less likely Yahoo! will show your destination as a top destination than your own much smaller site for example — fewer destination sites means a greater likelihood something will show as a top destination). I make no promises on the tab bit though — perhaps that’s enough to circumvent the issue, and perhaps it will still be tracked if your site was the last visited while the browser was open.
I suppose you could also just never visit any kind of website you wouldn’t want your friends, family, colleagues, employer, or clients to know you’ve visited. But I really don’t think that’s a solution.

There’s one last thing I want to touch on, and that is the fact that Compete does not currently allow website owners to opt out of having their sites’ statistics measured and shared. Personally, I find that unacceptable. I do understand that they make their bread and butter by selling data. And the reasoning I was given was that if Compete lets one site owner opt out they have to make the option available to everyone. To that I say, yes, they should indeed make that option available to everyone. After all, if even Google lets you opt out of being indexed and therefore having information shared (like incoming links or information searchable via site-specific searches), then there is little excuse for that option not to exist in this case.

I don’t believe for one second that Compete or anyone working for the company is intentionally sharing personally identifiable information about any blogger or website owner. But the sad truth is that even if rare, it does happen. And I don’t know about you, but I would very much like to see that change. And if it does not, then I’d like to see Compete make opting out of reporting an option for site owners, especially if they do find their own browsing history is being exposed in some way.

Thanks for bearing with me for what turned out to be more of a short e-book than a blog post here. And thank you again to Eric Austrew for taking the time to talk to me, whether or not we’ll ever fully see eye to eye.

If you want to check your own blogs or websites to see what kind of referral traffic is being displayed and associated with your site, visit Compete.com. You can also learn more about Compete’s panel if you want to know more about where their data comes from.

13 COMMENTS

Gail August 28, 2010 at 5:11 am

I know you are too wise to do not realize that sites that continually ask for more “secret questions” to be answered under the guise of “password recovery” are only thinly veiled excuses to build every larger profiles on us.

That they routinely ask for the names of distant relatives, where you grew up, where you went to school, etc. is of even greater concern because the main reason someone would want to know about obscure relatives is to find you if you choose not to be found.

I hope that your post here and that revelation will get people to really start thinking about why someone is asking the questions they’re asking and what other purposes it might serve. And don’t even get me started on the total naivete of allowing any site or device to publicly post your location.

One run-in with an overzealous admirer or a dangerous ex will cure you of that because some of them can only be eliminated from your life if you’re willing to make yourself impossible to locate – far easier said than done!

Did you see the latest about Facebook allowing OTHER PEOPLE to “check you in”? Some bloggers have wisely documented how-to opt out of that automatic behavior. (Naturally the default is to allow that – and how many Facebook users realize that? Even if you can’t imagine any reason you don’t want someone to reveal where you really are, consider some of the consequences of someone “revealing” where you are not.

Reply
- Jennifer Mattern August 30, 2010 at 10:10 am
  
  Fortunately most give you the option of what to answer these days, and there are often options that really aren’t personal at all. For example, “what is your favorite restaurant” or “what was the name of your first pet.” So it’s smart to choose the ones that don’t reflect anything deeply personal. Or, you can always lie. There’s nothing saying you have to give accurate answers. They’re just used for password hints / recovery most of the time, so all that matters is that you choose an answer you’ll actually remember, and one that other people couldn’t guess.
  
  And yeah, I mentioned the issue of Facebook letting other people check you in on another post I actually just wrote for the site (not live yet). I don’t care if it defaults to your friends. It should default to being disabled. End of story on that one. Your friends would know far more about where you are than strangers anyway, so there really is no protection on that front. If anything they’re giving access to people who could share the most about you. I find that disgusting. Then again, privacy is one of the big reasons I stayed away from Facebook from the get-go, and haven’t regretted that decision yet.
  
  Reply
Andy @ FirstFound August 31, 2010 at 10:51 am

That’s pretty sinister. It’s bad enough for opt-in services like Facebook, etc, but for analytics software to be actually monitoring your history?

Imagine how much that’s worth to advertisers?

Reply
- Jennifer Mattern August 31, 2010 at 11:41 am
  
  Just to be completely fair, they’re not directly monitoring our browsing history. It’s being monitored by 3rd party “partners” that we have some kind of relationship with, where it says they can sell or give our information to others.
  
  But there’s a big difference between most terms I reviewed while writing this post and giving information away that could be personally identifiable (most terms clearly stated they would not give personally identifiable information). The problem is that these companies assume what’s not personally identifiable to them is not personally identifiable to anyone.
  
  That simply isn’t true as I’ve demonstrated not only identifiable info about me that was displayed, but also about at least one of the few other people I checked, where knowing them as a colleague and client I was able to determine their browsing history from that of their site’s visitors. Any way they try to cut it, that equals “personally identifiable.” If I could do it in an innocent light with a colleague, there’s nothing stopping less-than-innocent competitors from doing the same to each other when we’re talking about individually-run sites and blogs, as many professional blogs are.
  
  Reply
Sean Weigold Ferguson August 31, 2010 at 10:10 pm

Compete’s referral data on your browsing behavior was almost certainly gathered through your ISP. Companies like Hitwise obtain data the same way. I believe they treat “referrers” in the same manner as well.

As consumers, we should all be cautious about the agreements we sign with our Internet service providers. Unfortunately, for many, there isn’t a lot of competition in their local market.

Reply
- Jennifer Mattern September 2, 2010 at 9:38 am
  
  Quite possibly. I’d been considering changing ISPs for a while now — only 3 in the area (Verizon, Comcast, and a smaller phone company). I use the smaller company because I have ethical issues with the other two. But after this drama I actually just decided to switch to Verizon. And I’ll likely lose Internet access for a couple of days in the process next week (not good when you run a business online). I figured at this point it’s better safe than sorry. Then again, there’s no guarantee Verizon isn’t a partner either. So the possibility remains that if you want Internet access you might have to unknowingly allow these folks to have and publish elements of your browsing history. I certainly hope that’s not the case, but we’ll see I guess.
  
  Here’s what Verizon’s privacy policy states:
  
  “Except in certain circumstances explained in our privacy policy, Verizon does not sell, license or share information that individually identifies our customers with others outside of Verizon for non-Verizon purposes without your consent. ”
  
  So if they do share this supposedly anonymous information and it turns out it can actually be used to individually identify a customer, they’re in violation of their privacy policy. And when signing up I did not see any special little opt ins for partner information sharing either, so the terms would be governed by the privacy policy that statement came from. So yeah… we’ll see if this helps.
  
  Reply
Dror Zaifman September 1, 2010 at 9:09 am

That is pretty bad of them to do that. To collect your information, store it and then use it to profit is pretty lame. Then playing games with the privacy statements just to find a way to do it.

Frankly I am surprised that Facebook has not faced any legal issues for their lack of respecting the privacy of their members.

I can understand they are offering it at no cost to the user but as a company weather it’s compete or Facebook, you have to give notice to the users of what information is collected before doing so and give users a choice to opt-in if they wish to do so not by default.

It’s not only respectful but also offers a safer community for the users.

Reply
- Jennifer Mattern September 2, 2010 at 9:29 am
  
  Quick correction — they aren’t the ones collecting the information up front. That would be ISPs and other “partners” who then give that information to Compete. I do, however, agree that it’s “pretty lame.” And the problem is that in the terms I’ve evaluated, none say they can give potentially personally identifiable information away — just the opposite. They’re just forgetting that “not personally identifiable to US” doesn’t equal “not personally identifiable.”
  
  Reply
David September 1, 2010 at 3:11 pm

I understand the platforms and how they build their data but I just ran the same site analytics reports for those two sites and got the same data as you. There are other platforms that have the data in much more detail and im sure the paid version of compete.com offers more insight…

Its just more of a coincidence that the top search terms and top referral sites are similar to you, ive never been to to most of the top destination sites listed for allfreelancewriting.com. The top destination sites for socialimplications are obviously:
sphinn because its a social book marking site and u have the sphinn button on your blog
google because most people visit this site
youtube really is anyone doing any work and not watching clips
facebook because so much time is wasted here
myfoxboston – no idea

Reply
- Jennifer Mattern September 2, 2010 at 9:27 am
  
  Yeah, the ones for SI made sense (for the most part). But it’s also a less personal blog, and not a starting page in my browser window (meaning it doesn’t show up before I visit another more personal destination). So it’s far less likely to inadvertently reveal personal browsing data. For my own sites it’s a different story, especially now that I often use Chrome and have multiple “home” sites show up in different tabs every time I open the browser. The stats in that screenshot really didn’t have anything too personal. The personal info showed up during July (meaning stats would have been from June).
  
  And while I’d still have a big problem with Compete giving this kind of information away to paying users, it’s the more limited free data that concerns me more. That’s available to absolutely anyone who runs a free search on their site, potentially putting private information in the hands of competitors who have no business knowing what other sites you visit or are affiliated with.
  
  Reply
Jennifer Mattern September 20, 2010 at 8:50 am

I just wanted to share an update on this. Compete’s rep had offered to help me get myself off of their panel right before I ran this post. I contacted him afterward and told him I’d give him whatever information he needed to make that happen. To this day, there has been no further response from Compete. So much for good faith efforts I guess.

Reply
Johnathan Hebert November 15, 2011 at 8:01 am

Just found your article… thanks for writing it. Here is an interesting paper on the details of how the ISPs collect browsing data using a product from a company called Phorm:

http://www.cl.cam.ac.uk/~rnc1/080518-phorm.pdf

In this paper, it mentions that the system is auto opt-in, manual opt-out by adding a special cookie on your machine for the webwise.com domain. This kind of thing should be illegal in my opinion.

Reply
- Jennifer Mattern November 16, 2011 at 9:17 am
  
  lol You have to love it when a paper starts out with an apology about all of the errors to come. That’s a bit scary.
  
  I’ve since changed my ISP twice (after writing this and kicking the one violating my privacy to the curb). So after being with my new one for a few months I’ll have to run some new tests and see if the problem still exists. I definitely don’t like auto-opt-in anything. I think it’s especially bad when your average user (who probably doesn’t even know what a cookie is in Web terms) is expected to add one to opt out of something. An opt out option should always be clear and openly offered, and it should be as easy as checking a box or clicking an opt-out link (just like the required unsubscribe links in emails thanks to anti-spam laws). While I can understand some things being auto-opt-in (so I won’t go so far as to say that should always be illegal), I do think it should be illegal to track information with only a mention buried in TOS for a service most people need these days without open and obvious ways to opt out.
  
  Reply