August 2008 Archives

Losing my Cookies

| No Comments | No TrackBacks

I think there are few people as paranoid as I am about which Web cookies are being placed in my browser. On most of my browsers, I have it set to accept only cookies from the domain I'm connecting to and allow me to review all cookies before I accept them. There are problems with this. To quote from Wikipedia's article on HTTP cookies:

Cookies have some important implications on the privacy and anonymity of Web users. While cookies are only sent to the server setting them or one in the same Internet domain, a Web page may contain images or other components stored on servers in other domains. Cookies that are set during retrieval of these components are called third-party cookies.

[...]

Advertising companies use third-party cookies to track a user across multiple sites. In particular, an advertising company can track a user across all pages where it has placed advertising images or web bugs. Knowledge of the pages visited by a user allows the advertisement company to target advertisement to the user's presumed preferences.

This is a nice synopsis. According to this explanation and conventional thinking, you should just not allow third-party cookies and your privacy will be intact (or at least better protected).

Enter Google Analytics

Google Analytics (GA) is a Web tracking software which Google offers free of charge to customers. Again, Wikipedia has a pretty good article about how it works. As a GA user, one adds a little piece of "hidden" Javascript on a Web page and as people visit your Web site, cookies are placed in the visitor's browser to keep track of them as they move from page to page. GA keeps track of timestamps, IP addresses, referrers, browser type, etc. A GA customer gets access to a dashboard which tells him useful statistics about his Web site. GA has the ability to drill down to specific pages, do time analysis, do geographic distributions, and generate a variety of reports. Don't get me wrong, the "good" part of Google Analytics is the ability for marketers, and those without access to traditional Web logs (or a good analysis program) to get quality Web analytic data about their sites.

Let's get to the "bad" part. In fact, it's not only bad, it's insidious. I'll explain. Since GA cookies come from the "originating" Web site rather than from Google, they are NOT third-party cookies, but they behave much like them. They allow the local GA user to track his Web site, but in order to make the report, that information also goes to Google. As more and more sites use GA, as users use Google Apps (and therefore identify themselves and their current location on the Internet to Google), it becomes possible for one company to track the nearly complete on-line behavior of a typical Internet user. That company is of course Google, by virtue of making life easier for their GA customers. It helps them better target on-line ads and AdWords to me and my typical use of the Web. Google Analytics is both brilliant and scary at the same time.

The way I battle GA is to deny GA Web cookies while still allowing cookies which the originating site requires to do business with me. This is sometime problematic. The previous version of GA required me to block at least 4 cookies per page. Sometimes pages included multiple elements that each had their own GA javascript in them. Sites like the RealAge site which Professor Chris Long blogged about require so many cookies for so many elements that it is literally impossible to take the RealAge quiz without allowing all cookies (including many, many GA cookies). I just gave up...

The new version of Google Analytics tries to place 11 cookies/per instance of GA javascript in a Web page. This is getting ridiculous. Why not just "deny all" cookies for these sites? That, of course, works for some sites (e.g. The College of Education, Penn State Live, and various blogs by my ITS colleagues), but there are many sites using them (e.g. The Penn State Office of Human Resources) where I do need to interact with the site to do my job (as supervisor, as employer, etc.). Right now, most of these sites only use the 4 cookie version of GA. What happens when they start using the 11 cookie version? Will I persist or just give up?

As an exercise in my growing pain, browse the sites you normally would first thing in the morning, but do them with the "ask me every time" flag set on "Accept Cookies?" If you really want to see something, delete all the stored cookies in your browser and all of the sites you have told your browser to "always allow/block cookies" from. (Hint: the ones that look like "_utm"something are GA cookies -- look at your cookie cache before your delete it, do you have any?)

What can be done?

  • One of the things I'm thinking about is a Firefox plug-in to intercept and deny all GA type cookies. I know there are ones which put incorrect information in the GA cookies and yield an incorrect report, but I think "poisoning" the information is wrong.
  • You as a GA customer can stop using it.
  • Penn State could issue a policy against the use of GA on Penn State Web sites as CIC peer The University of Indiana has.

The last suggestion is worth thinking about, but I think we can't do it unless we have an alternative -- a Penn State Web statistics/analytics service which any of us can use (regardless of "official" nature of the Web site). As far I know, Indiana does not provide a centrally supported alternative.

Should I learn to stop worrying and love Google Analytics, or should I continue my Pyrrhic battle to keep my browser GA cookie free?

It's enough to make you lose your cookies...

Lympic Fever

| No Comments | No TrackBacks

I feel my quadrennial bout of (Summer) 'Lympic Fever coming on. I absolutely love the Olympic Games and I'm looking forward to the spectacle of China hosting these games. I'm also preparing myself for the quadrennial disappointment of bad TV coverage. I know that NBC using their seven or so channels (CNBC, MSNBC, USA Network, NBCU, Telemundo, Oxygen, and of course good ol' NBC) claims that this will be the most coverage of any Olympic Games. I have no doubt that that's true, but I do have a few pieces of advice:

  • Show the darned sports! Please, I know everyone in the Olympics, particularly the U.S. athletes, has a hard luck story of how she almost didn't live to see these Olympics... show the darned sports! Put these stories on your Web site and refer people there.

  • Show the guy from Lichtenstein who's two minutes slower than everyone else! Too often during a live event we see profiles while the live action is going on. We only are shown the top few people in the event and of course the Americans. Show as many competitors as possible... live. When you see the top competitors and can compare them to the not so elite, you really get the sense of how exceptional the medalists are.

  • Show it LIVE! There's a 180 degree phase shift between Beijing and the Eastern Time Zone. That means that prime time in the US is from 7am to 11am in Beijing and conversely, the evening events take place in the morning in the U.S. I don't care; I'll stay up. My frustration will almost assuredly begin tomorrow morning -- the Opening Ceremonies. It starts at 7am ET. Show it in its entirety during the Today Show; don't wait until prime time. People will watch it again in prime time. When I was growing up in Western NY, we had the advantage of getting the CBC coverage from Toronto. There's an outfit that knows how to cover the Olymipcs. I checked their schedule and sure enough, they are showing the ceremonies live tomorrow morning.

    Maybe I could go home for two weeks...

  • Hypocrisy

    | No Comments | No TrackBacks

    As some of you found out yesterday, our friends in the Microsoft e-mail department "blacklisted" Penn State e-mail. This that e-mail to recipients with .live.com or .hotmail.com addresses was blocked by Microsoft's incoming e-mail servers. The stated reason was too much SPAM originating from .psu.edu to those e-mail addresses.

    Here's the irony. Most of that SPAM is due to "phished" Penn State e-mail accounts, where our account holders are asked to verify their userid and password and reply with them in e-mail (By the way... WE WILL NEVER ASK YOU TO DO THIS!). In many cases, the replies go to what I'd call loosely vetted, ephemeral addresses. By "loosely-vetted" I mean identities which may be obtained quite easily without much proof of identity (e.g. login or sign-up here...). Examples of these types of addresses are GMail, live.com, Yahoo!, Hotmail accounts. By "ephemeral" I mean that once the "phish" replies start coming in, the provider is notified and the accounts are terminated. Of course, our "phish"ers ("phishermen?") obtain another account tomorrow from these same providers. In the meantime they have access to a whole new set of IDs and passwords to SPAM and carry out further "phishing" schemes.

    From what I've seen, GMail doesn't do this blocking because they know they are part of the problem. The other loosely-vetted, ephemeral e-mail providers don't seem to see it that way. A group that I belong to, the Higher Ed e-mail Administrators list, has a project to keep track of these addresses and leverage the "Wisdom of the Crowds." I think it's a good effort and those who subscribe to these lists often save themselves a world of hurt by heading the scam off early.

    If you look at phishing attempts at all (I get 20-40 per day), take a survey of where the "Reply-to:" address is pointed. If it's hotmail or live.com, I think I know where I'm going to send those from now on.

    Of course, I might not be able to get through...

    NASA at 50

    | No Comments | No TrackBacks

    This week was NASA's 50th birthday. Actually it's been 50 years since President Eisenhower approved the National Aeronautics and Space Act of 1958, but NASA didn't start operations until October 1, 1958. I'm sure there will be much bigger celebrations then.

    This anniversary combined with Randy Pausch's death makes me consider what inspired me as a kid, particularly what inspired me to pursue the current course of my life. NASA as you can probably guess was a huge part of it. As a kid, I distinctly remember Neil Armstrong stepping on the moon as well as the Apollo 13 mission (although I must admit that my memories are probably augmented by the movie). As a scientist, I worked on several NASA grants for astronomy and Mission to Planet Earth. I'm still very inspired by NASA images and missions.

    If I take a look back at the last 50 years (only some of which I've been alive for -- OK most), I think NASA's unmanned missions have been a spectacular success particularly for astronomy and Earth science. The manned missions have fallen on hard times in the last few decades due in part to two shuttle disasters, but also due to budget cuts. The budget cuts are unfortunate because I believe that NASA in the 60's and 70's led to a whole generation of scientists and engineers. That's good for the economy in the long run. Like the Olympics, NASA also inspired national pride -- we might not have put the first human in space, but we "won" the race to the moon. Senator and former astronaut John Glenn used to say that for every one dollar spent on the Apollo program, seven dollars were returned to the U.S. economy. Many of the advances in miniaturization, electronics, computing, and yes even food science (can you still buy Tang?) were driven by NASA and NASA contractors.

    I only hope that the new Congress and the new Administration will fund NASA at levels which will continue to inspire future generations of Americans and particularly American engineers and scientists.

    About this Archive

    This page is an archive of entries from August 2008 listed from newest to oldest.

    July 2008 is the previous archive.

    September 2008 is the next archive.

    Find recent content on the main index or look in the archives to find all content.

    Categories

    Pages

    Subscribe

    Powered by Movable Type 4.21-en