Research


15
Mar 10

Upcoming Talks

Some quick notes on upcoming talks:

This Thursday, March 18, I’ll be speaking at the University of Michigan’s School of Information.    The title of my talk is “Disclosure, Privacy, and Support: An Integrative Perspective on Social Media.”  The talk is open to the public, and is scheduled for 4PM in 1202 SI North.

In April, a bunch of SILS folks will be heading to Atlanta to attend the CHI conference.  The conference organizers asked that presenters create a short video for their talk.   We had fun with ours:

The video is from the Prelinger Archive, a great archive of public domain videos.  You can also download the pre-print of our paper.


14
Nov 09

Interview with WBZ – Boston

A little while ago, I taped an interview with WBZ News Radio out of Boston, MA regarding my new research on older users of social network sites.

LinkMP3 Download

I’m looking forward to continuing this research in the spring – we’re currently scheduling a seminar at UNC’s Institute on Aging, so I will share the date of that seminar when we get that on the books.


23
Oct 09

Upcoming Talks and Conferences

Late October/Early November is looking very busy.  I’ll be giving the following talks:

Then I’m off to ASIST (Vancover, BC) were I’ll be talking about:

  • “The Supportive Behaviors of Older Social Network Site Users” (with V. Stull and C. Thompson) at SIG-SI
  • “Social Network Sites and Information Seeking During a Life Transition” at SIG-USE

I’ll also be interviewing for faculty positions while at ASIST – send any advice or leads my way!  Thankfully, the semester quiets down after ASIST, and I can look forward to a December full of writing and college basketball.


1
Sep 09

The trouble with Internet surveys

Gary Langer, the director of polling at ABC News, shares the bad news regarding Internet surveys.

In the most extensive such analysis to date, David Yeager and Prof. Jon Krosnick compared seven non-random internet surveys with two others based instead on random or so-called probability samples. The non-probability internet surveys were less accurate, and customary adjustments did not uniformly improve them.

While the random-sample surveys were “consistently highly accurate,” the internet surveys based on self-selected or “opt-in” panels “were always less accurate, on average, than probability sample surveys, and were less consistent in their level of accuracy,” the researchers said. Further, they said, adjusting these samples to known population values had no effect on accuracy (and in one case even worsened it) as often as that process, known as weighting, improved it.

Also noteworthy:

While this paper is the first to evaluate the subject in such detail, intimations of these problems were posted in a blog item this summer by Reg Baker, COO of the research firm Market Strategies International. Estimates of smoking prevalence were similar in three probability samples, he reported, but less similar – with variation of as many as 14 points – in 17 opt-in online panels. In such panels, he said, “the results we get for any given study are highly dependent (and mostly unpredictable) on the panel we use. This is not good news.”

Yeager and Krosnick, meanwhile, provide one more eye-opener: The average highest weight for any one respondent across the opt-in online samples was 30 – one respondent, that is, standing for the equivalent of 30 in the full dataset. (And one went as high as 70.) The highest weights in the two probability samples, by contrast, were 5 and 8.

Nothing new or groundbreaking here, and yes, a little inside baseball, but relevant in the light of all of these web surveys showing that “Teens don’t tweet.”  First, convenience-sampled web surveys can’t offer standard errors, and the weighting process that produces errors is highly susceptible to inflation in areas where data are sparse.  This sparseness commonly occurs when studying the behavior of a low-response population such as young people, and is multiplied when studying an early-adopting phenomenon like Tweeting.

Langer’s blog is a worthwhile resource if you’re interested in survey methods.  And I hope to resume blogging – updating my syllabus, posting some recent papers, etc. – when I get a spare moment.

via Study Finds Trouble for Internet Surveys – The Numbers.


5
Aug 09

Teens Don’t Tweet, or, How to Read a Web Panel

In the past few months, we’ve seen a number of studies of dubious methodology make sweeping generalization about Twitter.  Examples include the Twitter gender study; another study asserted that because only one out of every five teens tweet, that teens don’t use the service (wouldn’t you like one out of every five teens to use your product?).   Nielsen joins this discussion by stating that “Teens Don’t Tweet”, based on  findings from their online panel.  They assert that “In June 2009, only 16 percent of Twitter.com website users were under the age of 25. Bear in mind persons under 25 make up nearly one quarter of the active US Internet universe, which means that Twitter.com effectively under-indexes on the youth market by 36 percent.”  Oh noes!

twitter_by_age

As these data will undoubtedly be reported breathlessly elsewhere, I thought it might be useful to step back and explore some of the issues with the methodology and conclusions.  So first, a note about the methodology.  The Nielsen NetView panel contains an impressive 250,000 users.  Metering software located on client machines records the websites visited by panel members.  A vast majority of the panel is recruited online; the panel is “calibrated” (weighted) against gold-standard sampling methods (Random Digit Dialing, etc.).

Survey weighting is a standard, fairly uncontroversial process.  It is commonly used and is thought of as preferable to census-type approaches that may systematically under-represent some populations.  However, reliable survey weighting gets tricky when the population is small.  Since teens are a notoriously hard-to-reach population, we generally see inflated standard errors around weighted teen respondents in a population survey.  Nielsen does not report standard errors, and the makeup of their panel is confidential, so therefore it is impossible to know how much error there is around the estimate of use.  If the panel is like other panels, though, there may be more error in young people than a high-response population, such as adults.  We’re very familiar with margins of error (the things you see in political polls, where error is reported as plus or minus 3 percent, etc).  An inflated error means the margin is larger, meaning that the estimate may vary by a larger amount.

This is not to put down Nielsen.  With 250,000 members, the panel likely has good coverage of young people.  Since my purpose is to use this example to critique web panels, we must point out two other issues.  First, bigger is not necessarily better if the sample is convenience driven.  Nielsen’s panel is very large, but simply because it is large doesn’t mean it is representative.  In fact, Nielsen is likely more interested in the larger size for better sparse-market coverage, as opposed to statistical reliability.  Second, the particular nature of recruitment into the main panel introduces selection bias.  If people aren’t selected randomly, then there may be characteristics of the population that covary with the variables of interest.  This is an omni-present issue with polling, but it must be noted.

So when we read a web poll of this particular nature, what are the critical questions we should be asking?  First, we should be concerned about cell size (the number of respondents) for a hard-to-reach population.  If young users are underrepresented, the standard errors on the estimates can be quite large (which may push an estimate around by +/- 10 percent).  We should also question the method of recruitment; if the majority of the panel comes in via the web, then who gets left out?  Since this poll is designed to represent online users, it is seems likely that heavy web users are participants (my guess).  But what if Twitter users actually aren’t like heavy web users?  There are a whole host of other questions we should ask regarding polls (response rate, sampling frame, etc) that generally aren’t answered in online polls.

It is important to understand the potential methodological issues when reading research.  Nielsen’s methods are standard for the industry, and they acknowledge the drawbacks and limitations.  In my opinion, the major problem isn’t the methods component, it is Nielsen’s spinning/presentation of its results.  In the Nielsen study (and the previous Participatory Media Network study), the findings focus on lack of teen use of Twitter.  However, the findings reported by Nielsen cover the following age ranges: 2-24, 25-54 and 55+.  The critical category, 2-24, covers a wide range of users – incredibly young children, adolescents, teens and adults.  The grand mean reported by Nielsen is affected by variation inside the different age categories.  Using census data, we can look at age breakdown over the ranges 2-24.  According to census, there are 80MM Americans under age 24 (0-24).  There are approximately 15-16MM Americans in the age ranges 0-4, 5-9, 10-14, 15-19, and 20-24.  Therefore, each category is pretty much weighed equally.  So lets (hypothetically) assume that no one age 0-9 uses Twitter, 5% of people age 10-14, 35% of people age 15-19, and 40% of people age 20-24 use Twitter.  To calculate the grand mean we would weight the percentages and then sum.  Such a formulation would give us 16% use for the demographic age 0-24 (0+0+.01+.07+.08).

The second problem with Nielsen’s presentation is the comparison range.  Comparing the age ranges 2-24 and 25-54 is not fair on a number of levels.  The first category can really only meaningfully cover age 13-24, while the 25-54 age range meaningfully covers 30 years.  If we weigh the estimates (16%/64%) by volume coverage (1:3 ratio), then the category volume for older users would be ~21% (I didn’t bother to weigh by census, just an estimate).  And what if we compared just teens against an adult category – we might even find that teens Twitter more than adults.  Keep in mind, with all the advantages afforded to older users (no Internet restrictions, etc) there are major differences between older users and teen/young users in their capacity to partake in online community.

My analysis is simplistic and speculative, but in certain configurations, “young people” could plausibly use Twitter at higher rates than “adults.”  I don’t have a guess regarding what is right, but my gut tells me that if Nielsen was more upfront regarding their sampling, and less misleading with their infographics, we’d have a different story.  And that story would not be as catchy and headline-grabbing as “Teens Don’t Tweet.”


4
Aug 09

Soliciting Participants: Research on Older Users of Social Networking Sites

Update: Thank you to all volunteers!  I have already filled my quota, so no more participants are necessary.  Still looking for a RA, though!

I am beginning a new research project that will look at older, recent adopters of social networking sites (the fastest-growing population on many sites). I am interested in how these users are using the sites, what motivated them to join, and how the sites benefit and affect their daily lives. The first part of the study involves in-depth interviews, for which I am currently recruiting participants. More information:

UNC-Chapel Hill researchers are conducting a study of recent adopters of social networking sites (Facebook, Myspace). We seek individuals, age 40 or older, who first joined a social networking site within the last two years. If you meet these criteria, we are interested in talking to you about why you joined the social networking site and what you do on the site.

To qualify for this research, you must be age 40 or older and have started using social networking sites within the last two years. Your participation is entirely voluntary. Individuals who wish to participate will be interviewed for one hour, and they will fill out a simple questionnaire. Participants will be compensated $10.00 for their time. Interviews can be in person, or remotely (over the phone, via Skype, etc.). To volunteer for participation, or ask any questions about the project, please email Principal Investigator Fred Stutzman at fred@fredstutzman.com. If you prefer, you may call 919-260-8508.

This research has been approved by the University of North Carolina Institutional Review Board, IRB-09-1303. Gary Marchionini, Ph.D., Cary C. Boshamer Distinguished Professor in the School of Information and Library Science, is faculty supervisor of this study.

Please feel free to pass this along to anyone who you think might be interested. If you are interested in this project, I am currently seeking a research assistant to help with transcription and analysis. This is a great job for a Master’s-level student looking for topical research experience.


23
Jul 09

Experience Social Search – and help Chirag graduate!

Fellow SILS Ph.D. Student Chirag Shah is doing some very interesting work on his Ph.D.  You can be part of it – here is the information:

My name is Chirag Shah and I am a doctoral candidate at SILS, UNC Chapel Hill. The purpose of this email is to request your participation in a research study investigating a collaborative information seeking system.

This study requires a team of two. You need to sign up in pairs. Both the participants in a team should have worked on some project before (e.g., a class assignment). You need to sign up for two sessions, which are one to two weeks apart. Your participation will take approximately one and a half hour per session. Approximately 45 pairs of participants are being enrolled for this study.

The study will involve using an experimental system, called Coagmento, while surfing the Web. Coagmento is a plug-in for Firefox browser, which provides support for Web surfing in a team. You will be paid total $25 (per person) for two sessions (thus, a team will receive $50). The best performing team will also receive two iPod Shuffles.

To participate in this study, please visit http://www.coagmento.org/study1/signup.php and submit your request. The approval of this request is subject to meeting all the criteria specified above.

Choosing or declining to participate in this study will not affect your class standing or grades at UNC-Chapel Hill. You will not be offered or receive any special consideration if you take part in this research; it is purely voluntary. This study has been approved by the UNC Behavioral IRB (IRB Study 09-1037, Approval Date 6/10/2009), and will be supervised by Prof. Gary Marchionini (march@ils.unc.edu) at SILS.