Posts Tagged: statistics


18
Apr 06

New RSS Statistics

I’m tempering my zeal for RSS with a dose of statistics. Steve Rubel of Micro Persuasion points out the most recent Ad Age Interactive Fact Pack. Looking through, it offers some insight into the demographic profile of RSS users. We’re wildly skewed male, and our demographic fits the profile of early adopter. The report neglects statistics on RSS awareness, which I feel is the key statistic for the market. In October 2005, Yahoo/Ipsos released an enlightening RSS white paper, which showed that only 12 percent of the US online population was aware of RSS, and only 4 percent of the US online population stated use of RSS. The optimist in me wants to believe these numbers are inching out, but unfortunately the Ad Age report doesn’t address this particular facet. Anyone know of any number that are more recent than Yahoo’s?


8
Jan 06

Student Life on the Facebook

Over the course of the last semester, I have been analyzing the behavior of UNC-Chapel Hill students in social network communities. In case you’re not quite sure what social network communities (SNC’s) are, they are services like MySpace, Friendster, Orkut and Facebook. I am particularly interested in Facebook, primarily because it is so heavily used on campus. In a previous study, I found that 88 percent of freshmen on the UNC campus had active Facebook accounts. As one might imagine, any service that reaches 88 percent of our freshmen is worth trying to understand, so I devised a system to sample the Facebook on one week increments. In this post (which will be long, and is available for download as a white paper – pdf), I’ll explore some of my findings, share interesting data and trends, and provide some of my personal opinion on this Facebook phenomenon.

I hope that this post will be of use to students, faculty and administrators, as well as the general public as they attempt to understand the Facebook particularly, and SNC’s more generally. As with any new phenomenon that affects our life, we have many questions and search for answers; I hope to help get a discussion rolling. In the sense that this post may help at-stake participants better understand the service and its uses, I hope that it is beneficial to all.

First, a word about my methodology. For each week, all students that self-identified as undergraduates, class of 2009, were sampled. This means their profiles were inspected, and data elements were analyzed. There is no way to “prove” that the students who self-identity as freshmen are actually freshmen, so the data is limited in this sense. Second, it should be noted that when I talk about the behavior of “freshmen”, I am referring to the behavior of freshmen in the Facebook. I will refer to my population as “freshmen”, but I make no claim of knowledge of freshmen behavior outside the Facebook context. (Update – The margin of error if these findings were to be generalized to all freshmen at the university, would be as follows: the first week of the semester is +/- .74%, and at the last week is +/- .54%)

Additionally, as with any large scale sample, there may be data imperfections due to technical difficulties encountered during the sampling process. However, I feel confident that any data imperfections would be covered under a reasonable margin of error. I must also state that while the Facebook’s terms of service prohibit the copying of data from the Facebook to a third-party, I am claiming fair use of the aggregate data that you will see here. Other than “adoption” data, which is well-known already, none of the data here deals with the Facebook, only with how students use it. It is my goal to understand how students share information in SNC’s, and the Facebook provides my vector.

To begin, we will look at adoption trends in the Facebook. One of the thing that surprised me is that on the first day of school, 3193 freshmen reportedly had an account. This was over 85% of the entire freshman class, and many had been using the Facebook for many months. As it turns out, the months of June and July represent the greatest months of Facebook account creation.


Click on image to see larger version

If we were to turn this data into a logistic S-curve, we would see a very characteristic adoption curve. To sign up with the Facebook, one needs a university email address to log in. When I compared my new account data to the dates of UNC’s orientations (held mainly during the months of June and July), I saw strong, direct correlations between orientation and account creation. Generally, the two days following orientation would represent a 200-500% percent increase of average daily account creation for the month. It became obvious that many students learned of the Facebook at orientation, most likely virally through friends, though it is also quite possible they learned of the service through semi-official means such as orientation leaders. Due to this, the students joined the site early, and were already comfortable users when school started.

Next, we will look at how use of the Facebook grew over the course of the semester. As I previously stated, on the first day of classes, 85% of freshmen had a Facebook account. Over the course of the semester, that number grew until over 94% of freshmen had a Facebook account. In the chart, we will look at how that trend, as well as examining the use of privacy in the Facebook.


Click on image to see larger version

The blue line represents total accounts, and the pink line reflects total accounts that are not private, meaning anyone with a UNC Facebook account can view that profile. As the semester passed, students protecting their profile grew from 3.2% to 4.75%. It should be noted that these total account numbers represent a revision upward from my previous blog posting; the methodology has been revised in this analysis and is correct.

While the actual number of nodes (the freshmen) in the network did not grow substantially over the course of the semester, the number of edges (friendship connections) in the network did expand remarkably. As the freshmen made friends over the course of the semester, their social network size grew from 144,319 to 373,651 connections. The average number of friends a freshman on the Facebook had on day one was 46, and at the end of the semester, he or she had 111 friends. This might give us a picture of how many friends a freshman might make the first semester of college: 65.


Click on image to see larger version

Of course, these 65 new friends can make an impact on the lives of freshmen. As the freshmen seek to stake out their identities, do they change dramatically over the course of the semester? The Facebook allows users to list their political orientation, which serves as, in my opinion, a very strong indicator of identity. One can orient an entire social experience around the philosophy of politics.

We all know that freshmen change dramatically in their first semester at college. However, I felt it might be interesting to see if they shift their political orientation in their first semester. Does college make Liberals even more Liberal? Does college make Conservatives more Liberal? These are spectrum questions, of course, and we all shift as we learn, but as it turns out, freshmen do not shift their political orientation during their first semester at college.


Click on image to see larger version

In the graph above, the political orientation of campus is presented as a moving ratio. As you can see, over the course of the semester, the number of students identifying as Liberal decreases, and the number of students identifying as Conservative increases. This must be qualified, though. In the table below, which breaks out the exact percentages, you’ll see that the movement is quite marginal. (Update – The number of freshmen who report a political orientation are a subset of all freshmen who use facebook; the margin of error at the first week of the study is +/- 1.04, and at the end of the semester is +/- .98. The margin of error if these findings were to be generalized to all freshmen at the university, would be as follows: the first week of the semester is +/- 1.28%, and at the last week is +/- 1.12%.)

Political Orientation Campus Share, First Week Campus Share, Last Week
Very Liberal 4.75% 5.0%
Liberal 33.58% 32.91%
Moderate 27.77% 26.78%
Conservative 26.46% 26.74%
Very Conservative 1.14% .99%
Libertarian 1.39% 1.83%
Apathetic/Other 4.89% 5.73%

It should also be noted that this data has severe limitations. It does not actually reflect individual changes – if two students switch political identity from opposing parties, they cancel each other out in this estimate. This chart would only catch mass trends on campus (a liberalization, etc). I plan to explore this data in more detail later in this post, and in additional, follow-on postings.

In a previous study, I analyzed how much information students shared in SNC’s (pdf). I extended this study to see how much information the students were sharing in the Facebook. Certain things, like name, school email, join and last update date are almost always shown. But what about optional identity information?


Click on image to see larger version

In this graph, we see the percentage of freshmen sharing identity elements that I felt were interesting. It amazes me how much potentially sensitive information is openly shared. For example, you can find out the birthday, hometown, sexual orientation (indicated through a field labeled “interested in”), relationship status and political orientation of 3/4 or more of all students. Of course, there is no guarantee that any of this information is truthful – but in the eyes of employers or people who don’t know you well, does that matter?

On the previous graph, you will also see Photo Albums – a handy and extremely well-done feature added by the Facebook towards the end of the semester. Nearly 45 percent of freshmen have Facebook photo albums, an adoption trend I find interesting. What was very interesting is that the Facebook allows you to tag your photos with the names of other people in the photos, and that allows your photo gallery to include not just pictures by you, but pictures of you by other people. In the following chart, the total number of persons shown in Facebook pictures is graphed. Please note, this is not a tally of all pictures, but of all pictures and tags. In just 8 weeks, the total number exploded from 9,783 to 78,413 – almost a nine-fold increase in volume, or an average of 23 pictures/tags a person!


Click on image to see larger version

As you see the trends, it becomes evident that Facebook is a very important part of a freshman’s life. To that extent, I found that nearly 50% of all freshmen had updated their account in the last week, proving the service to be extremely sticky. Extending that time period to two weeks, greater than 67% of freshman had updated their profiles in that time period. The wall, which allows students to post public messages to each other, has 202,879 public messages to freshmen – or almost 60 per individual. Without question, the Facebook is an important resource for the freshmen, and it is used extremely often.

I’ll conclude this report by indulging myself with some political demographic analysis. As we’ve seen, the Facebook allows students the ability to classify themselves by political orientation. I thought it might be neat to place the Liberal and Conservative bloc’s next to each other, to see what I could find out.

The first thing I did was compare friend network sizes. I wanted to see who had more friends – liberals or conservatives. As it turns out, freshmen who are liberal have an average of 115.4 friends, while freshmen who are conservative have an average of 117.6 friends. The conservatives, it appears, are better at making friends than liberals by a 2.2 person margin!

Finally, I looked at some of the “favorites” of liberal and conservative freshmen. Facebook allows users the ability to list their favorite things – movies, music and books. I decided to compare the right and the left, so I conducted a little one-time pulse-like analysis of the student favorites. The results are below.

Favorite Books

Rank Liberal Conservative
1 Harry Potter Bible
2 Catcher in the Rye Harry Potter
3 The Great Gatsby The Great Gatsby
4 1984 To Kill a Mockingbird
5 To Kill a Mockingbird Da Vinci Code
6 Da Vinci Code Pride and Prejudice
7 Pride and Prejudice Angels and Demons
8 Siddhartha Catcher in the Rye
9 The Giver The Notebook
10 Angels and Demons Lord of the Rings

Favorite Movies

Rank Liberal Conservative
1 Garden State The Notebook
2 Fight Club Anchorman
3 Donnie Darko Wedding Crashers
4 Anchorman Old School
5 Office Space Gladiator
6 The Notebook Dumb and Dumber
7 Wedding Crashers Finding Nemo
8 Love Actually Dirty Dancing
9 Pulp Fiction Zoolander
10 Eternal Sunshine… How to Lose a Guy…

Favorite Music

Rank Liberal Conservative
1 Coldplay Jack Johnson
2 Jack Johnson Coldplay
3 The Beatles Rascal Flatts
4 The Killers Kenny Chesney
5 Dave Matthews The Killers
6 Bob Marley Oar
7 The Shins John Mayer
8 Modest Mouse James Taylor
9 Radiohead Switchfoot
10 Green Day Tim McGraw

Take from these what you will – the inbound data is not especially clean, and while I made every effort (and they look right, for the most part), there’s no guarantee these rankings are anywhere near perfect. What struck me about this data is that, for the most part, politics may divide us bitterly – but we actually share more in common than we might realize.

I’ll conclude this report by reminding the reader that this data represents only a very small demographic – one class of freshmen at one university in the Southeastern United States. This is not meant to be a representative study of anything more than the exact population it studied, though I wouldn’t be surprised if the use trends are similar across many, many campuses. It is important to note that this is not a study of the Facebook per se, but how students use social network services. It does not represent the Facebook as a part, or a whole. This is primarily a series of snapshots in time, showing how our freshmen use this extraordinarily successful service; hopefully from it the reader will take away a broader understanding.

Fred Stutzman is the author of this report and a Ph.D. student at the School of Information and Library Science at UNC-Chapel Hill. His research interests involve social software and networks, personal identity management, information retrieval and knowledge discovery. He can be contacted at fred@metalab.unc.edu with questions or comments. This report is available for download as a white paper (pdf). Thanks to John Joseph Bachir, Terrell Russell, and Gary Marchionini for assistance proofing, editing and refining this document.


6
Dec 05

Clickthrough research (Deconstructing the Facebook Series)

The post below is a little cheezy – and a little out of the norm for my blog. The reason it appeared is because it was the target of an announcement ad placed in UNC’s Facebook. Don’t read too much into this, what I really wanted to do is get some quantitative data on clickthrougs, because ibiblio may start posting job advertisments. Before we do a big ad buy, I wanted to see how well the ads pay off.

The Facebook at UNC promises 54k impressions for 17$ (for members, 35$ for non-members). I ran my ad for a day, and the general trend is that my traffic doubled (however, seeing as my traffic is next to nothing, its not saying that much). Coincidentally, Blogger seems to be suffering random outages (6PM-11PM EST yesterday, 2AM-5AM today), so I couldnt have picked a worse 24 hour period to buy an ad.

For impressions, you can’t beat the price, especially if you’re a member. Clickthroughs are paltry, but there are many variables I didn’t control, including crappy ad copy. Nevertheless, pretty interesting. If anyone else runs announcments, let me know how it works out for you.


2
Oct 05

The Freshman Facebook Zeitgeist

One of the interesting things I’ve been doing with my Facebook studies (see here, here, and here for more info) is collecting data on the Freshman class over time. I believe that the Freshman can tell us a lot about technology and adoption trends, and from a broader perspective, what social and political trends may be emerging on campus. To both extents, it’s fairly easy to pull out some simple metrics on these from my collected data. I’ll present a little of that data here.

First, who uses the Facebook

Basing my data analysis on these UNC-provided figures, I can say with relative confidence that nearly every Freshman on campus uses TheFacebook. On day 1 of class, I was able to discover 3095 Freshman profiles (!!!). 5 weeks later, I was able to discover 3292 Freshman profiles. 88 percent of Freshmen have an active profile on the Facebook. Lets look at these trends over time:


Click on any graph for a larger version

This graph shows total Freshman profiles found, and the lower line in the graph indicates the 2005 Freshman profiles with no “Privacy” flags set. I’ve found that a little more than 1% of Freshmen use privacy flags. Interestingly, privacy use has declined over time, at a constant rate (Week 1: 3% privacy use; Week 3: 1.5% privacy use; Week 5: 1% privacy use). This feeds into the notion that students that do not share their profiles online are not “participating” as normal social actors.

Interesting in this data is the “Week 4 depression”, which may represent students having buyers remorse about living so publicly. This is just a guess, though.

How often are Freshmen using the facebook?

One of the questions posed to me during a recent talk was “How often are students using the Facebook”, which I roughly translate to “How often are students logging in to the system, etc”. Facebook does not provide “last access” data, though it does provide “last update” information, indicating the last time the user has updated their profile. Accepting that “updating” one’s profile places a slightly higher bar on system use than simply “logging in”, I took a look at how often students are updating their profile. The results amazed me.

Using data collected on September 27, I analyzed last update; as you can see, 2871 profiles were updated in September, with 1819 profiles updated in the last week. I was really blown away by this, so I broke out the September data by day.

We can clearly see a trend that profiles are being updated most frequently in the days preceding the data collection. Assuming that “last update” stands in as a good proxy for system use, I believe we can safely say that students are using TheFacebook extremely frequently.

Friend network expansion

One of the interesting metrics we can pull out of the data is average friend network size; in doing so, we watch the obvious and interesting aggregation of a social network on campus.

The average moves up at a fairly regular interval. For the network geeks out there, here’s data on the Freshman social network size:

Week 1: 3095 nodes, 144,260 edges
Week 2: 3188 nodes, 167,783 edges
Week 3: 3241 nodes, 198,668 edges
Week 4: 3217 nodes, 218,534 edges
Week 5: 3292 nodes, 245,883 edges

Freshmen political orientation

2423 Freshmen report a political orientation in the Facebook; this is 74%, or about 3 out of ever 4 students. Reported here are some well thought out reasons why to take political orientation data in the Facebook with a grain of salt. And to that extent, I agree completely. The notion of labeling in the Facebook may be completely arbitrary; a good amount of the data in the system represents in-jokes and the like. The students use codes in the system to communicate and display in-network group strength and identity. Stepping back from that, though, I still think its interesting to see the data. And of course, it has surprised me.

While Liberal students again outnumber Conservative students, there is suprising parity between the two factions (assuming, of course, that UNC is a liberal institution). Is this what the post-9/11 student body looks like? When political orientation becomes a face-value aspect of identity, how does it force factionization in friend-making behaviors? If we all purport to know each other’s politics, is it not hard to imagine freshmen making decisions based on this highly characteristic identity element? Please see this post for more information on this data, the pooling that goes on, etc.

Now, for some fun

First, I present what I’m calling the Happy Index. This is Freshmen relationship identification over time. I’ve decided to include all fields available in the Facebook, though the graph can more effectively be segregated along Single/Not-Single lines. It is interesting to see how Married and Open Relationship increase over time as the in-joke nature of the claim moves through the network. Since the network size changes each week, this is presented as a ratio of the total network.

Finally, I analyzed what sort of websites students are identifying as their personal websites. As we might expect, the majority of personal websites claimed by students are of the photo, blog, or profile hosting nature.

Interestingly, only six unc.edu sites were returned, with not one being a unc.edu/~ personal site. Not one freshman reports using his or her UNC provided webspace. This is alarming, but not surprising.

For questions about this data, please contact Fred Stutzman