Verified Document

Extracting Information Sentiment From Blogs Research Proposal

9). Moreover, just as content analysis of other written and symbolic forms has provided new insights that might have otherwise gone unnoticed, the analysis of blog content may reveal some unexpected findings concerning hot topics and significant social trends that are shaping the users of this information. For example, a data infrastructure engineering team intern working at Facebook recently generated an eerily accurate global map based on Facebook friendship links. According to the developer, "I was interested in seeing how geography and political borders affected where people lived relative to their friends. I wanted a visualization that would show which cities had a lot of friendships between them" (Butler, 2010, para. 3). While Butler had some vague ideas about the types of clusters that would populate the map, he would surprised by the results in the way they mirrored the population densities of the world so accurately, with some noticeable absences (Cuba, North Korea, large parts of Africa and South America, the western half of the United States, etc.).

Based on his content analysis of 10 million Facebook friendship links, Butler plotted the location of each individual's latitude and longitude lines and generated connecting lines between each friendship pair, with higher levels of paired links being shown as brighter lines in the map in Figure 1 below.

Figure 1. Butler's Facebook friendship links map: dark areas on the map represent where Facebook use is less prevalent

The map's striking similarity to geopolitical maps was also noted by Butler. According to Butler, "Not only were continents visible, certain international borders were apparent as well. What really struck me, though, was knowing that the lines didn't represent coasts or rivers or political borders, but real human relationships. Each line might represent a friendship made while travelling, a family member abroad, or an old college friend pulled away by the various forces of life" (2010, para. 4).

This analytical approach is also used by Finin and his associates for sentiment-identification purposes. According to these authorities, "Our approach uses the link structure of a blog graph to associate sentiments with the links connecting blogs. Such links are manifested as a URL that blogger a uses in his blog post to refer to blogger B's post. We call this sentiment link polarity, and the sign and magnitude of this value is based on the sentiment of text surrounding the link" (p. 78). Clearly, this type of online data can be used to reveal some valuable new information in ways that have never been possible in the past.

Such graphic representations are just some of the attributes of written communication that content analysis can provide. Because blogs (and this term can be expanded to include the idle chit-chat, back-and-forth, thoughts, ramblings, viewpoints and other posts shared on Facebook and other social networking fora ever day) represent an incredibly accessible way to reach other people, and people who know those people and so forth in an ever-widening network of social interaction. This accessibility may be fundamentally more significant in the long-term than other important innovations in communication such as the telephone. In this regard, a growing number of observers cite the increasing importance of the Internet in the business world and suggest that blogging has become the platform of choice for consumers and their favorite companies (Pikas, 2005). For instance, Bielski emphasizes that not all bloggers are created equally, at least with respect to their online posts. "Certainly, there is hype surrounding Web 2.0 with its dual message of the internet as application platform and internet as the ultimate participatory forum. and, blogging is viewed as a staple of this new internet" (2007, p. 8).

Identifying recurring themes and emerging trends in this type of dynamic environment is a challenging enterprise to be sure. As Bielski points out, "Yet out of the glare, the reality of user-generated content is a mixed bag. The writing can be freeform, to put it politely. Many blogs look horrible," she notes and adds that many are "boring, or 'safe' might be better adjectives" (2007, p. 8). Furthermore, this "mixed bag" of blog content makes identifying posts that may communicate certain sentiments even more challenging. According to Bielski, "Corporate creators don't make these blogs easy to subscribe to, search through, or otherwise interact with" (2007, p. 8).

Fortunately, Google provides a series of URL templates that can be "invoked via command M-x emacspeak-url-template-fetch normally bound to control e u . This command prompts for the name of the template, and completion is available via Emacs' minibuffer completion" (Google Blog Search, 2010, para. 2). The steps involved in conducting this analysis for each URL template are as follows:

A. Prompt for the relevant information.

B. Fetch the resulting URL using an appropriate fetcher.

Set up the resulting resource with appropriate customizations.
Although "unblog-related," the template application used by Google Blog Search developers provides a useful example of how this procedure operates. According to Google Blog Search, "As an example, the URL templates that enable access to NPR media streams prompt for a program id and date, and automatically launch the realmedia player after fetching the resource" (2010, para. 3). As to their online application, the developers at Google Blog Search describe their efforts thusly: "Blog Search is Google search technology focused on blogs. Google is a strong believer in the self-publishing phenomenon represented by blogging, and we hope Blog Search will help our users to explore the blogging universe more effectively, and perhaps inspire many to join the revolution themselves" (2010, para. 2). As to the expected blog content that will be sentiment related, the developers make it clear their hosting ranges the entire human experience:

Whether you're looking for Harry Potter reviews, political commentary, summer salad recipes or anything else, Blog Search enables you to find out what people are saying on any subject of your choice. Your results include all blogs, not just those published through Blogger; our blog index is continually updated, so you'll always get the most accurate and up-to-date results; and you can search not just for blogs written in English, but in French, Italian, German, Spanish, Korean, Brazilian Portuguese, Dutch, Russian, Japanese, Swedish, Malay, Polish, Thai, Indonesian, Tagalog, Turkish, Vietnamese and other languages as well (Google Blog Search, 2010, para. 3).

Some of the other key features that make Google Blog Search useful for the purposes of the proposed study include the following:

A. The links allow user to browse Google Blog Search results by topic. For example, clicking the Technology link shows top stories in the tech world.

B. The goal of Blog Search is to include every blog that publishes a site feed (either RSS or Atom). It is not restricted to Blogger blogs, or blogs from any other service.

C. Google Blog Search uses a set of algorithms to try to determine the most popular stories in the blogosphere. The applications takes into account factors such as a blog's title and content, as well as its popularity throughout the rest of the blogging community. The results are displayed based on groups of posts that are closely related..

An informal blog search using Google's "search blogs" feature provides the following raw sentiment-related search results:

Table 1

Blog Search Results of Sentiment-Related Terms (as of December 20, 2010)

Search Term

Number of Matches

Love

467,098,607

Hate

67,059,281

Awesome

79,550,156

Terrible

17,692,083

Angry

24,621,192

Like

821,870,100

Dislike

6,399,023

Enjoy

152,132,318

Clearly, there is a great deal of sentiment being expressed in blogs, but without knowing the specific context in which these sentiment-related terms are used, though, it is impossible to discern their true meanings. For instance, some bloggers might enthuse that they "just love the pasta at Joe's Spaghetti House," while others might state they "love the president's economic policies." Likewise, other bloggers might "hate the weather" while others "hate the president's economic policies." Given the enormous response to the search term "like," it is clear that some bloggers might "like Ike" while others use the term as a comparison as in, "Eating at this restaurant is like a trip to the dentist's office." The context of the sentiment-related posts will therefore require comparison to a corpus of various sentiments used in common practice to identify positive from negative sentiments (Ojala, 2009). For example, the word "like" or "love" when used immediately with or adjacent to descriptors such as "movie" or "restaurant" could be categorized as a review, while these words used with descriptors such as personal nouns might indicate a romantic relationship. This corpus would be fine-tuned as the learning process proceeded through additional permutations of the supporting algorithms.

The results of a study by Manning (2009) that sought to identify effective ways to garner sentiment-related data from online reviews provides some useful insights into what steps are involved in the blog-searching process. According to Manning, "A large and growing body of user-generated reviews is available on the Internet, from product reviews at sites like Amazon.com to restaurant reviews at sites like Yelp.com. For users making a purchasing or dining decision, the opinions of others can be an important factor" (p. 1). The need for a method by which blog posts can be…

Sources used in this document:
References

Bichard, S.L. (2006). Building blogs: a multi-dimensional analysis of the distribution of frames on the 2004 presidential candidate Web sites. Journalism and Mass Communication

Quarterly, 83, 329-333.

Bielski, L. (2007). Got blogs? Not exactly a banking staple, a few pioneers have embraced this 'new media.' ABA Banking Journal, 99(5), 7-9.

Brynko, B. (2007, June). Northern Light's MI Analyst: New visions in marketing research.
Butler, P. (2010, December 10). Facebook friendship map visualizes connections around the world. Huffington Post. Retrieved from http://www.huffingtonpost.com/2010/12/14/facebook-friendship-map_n_796448.html.
Google Blog Search. (2010). Google. Retrieved from http://emacspeak.sourceforge.net/info / html/URL-Templates.html.
Department of Computer Science. Retrieved from http://nlp.stanford.edu/courses / cs224n/2009/fp/14.pdf.
Cite this Document:
Copy Bibliography Citation

Related Documents

Net Neutrality Ensures the General
Words: 625 Length: 2 Document Type: Essay

Another problem with data discrimination is that search engines like Google might not yield the best information. It is one thing for Google to allow for advertisements in a separate section from search results. It is quite another for Google to only yield search results for paying customers. Some ISPs claim that the consumer would benefit from value-added services to make the Internet faster or more secure. Yet the principle

Net Neutrality Network Neutrality, Also
Words: 1631 Length: 5 Document Type: Reaction Paper

The blessings of the free market in terms of competition, level playing field, and end user benefit can only continue if the Internet remains neutral across all networks. In conclusion, Wu's arguments are much more convincing than those by Yoo. Wu holds that Network Neutrality is essential for the benefits of its free market platform to continue, especially in the light of end user benefit. Innovation and competition can only

Net Neutrality
Words: 1920 Length: 6 Document Type: Essay

Net Neutrality: Benefits, Drawbacks, Issues and Concerns The Internet has been such an immense fixture in the lives of most Americans that it is impossible to imagine life without it. The Internet has become an invaluable tool to virtually everyone, and most people can’t imagine functioning without an open, free Internet that is available to everyone. In many ways, the Internet is a tremendous foundational pillar of society and of democracy:

Net Neutrality Essay
Words: 2502 Length: Document Type: Essays

In this essay about net neutrality, we provide an overview of what net neutrality is and why it is a current political issue.  The essay will define net neutrality.  Furthermore, it will describe the pros and cons of net neutrality, including reasons that net neutrality is beneficial and ways that it could be detrimental. The essay will discuss the current legal status of net neutrality, as well as the potential future

Effect of Consumers and Net Neutrality: Comcast-Netflix Deal
Words: 426 Length: 2 Document Type: Essay

Netflix-Comcast deal has been applauded and criticized in equal measure since its coming into being in February this year. Under the deal, Comcast (an ISP) will connect directly to Netflix's (a content provider) servers, essentially eliminating content delivery networks that often act as middlemen, and consequently, ensuring that Netflix's traffic gets minimum disruption in the broadband network (Woollacott, 2014). So, what exactly does this mean for Comcast's consumers and consumers

The Debate on Net Neutrality
Words: 1497 Length: 5 Document Type: Case Study

Net Neutrality: The Battle Rages onThe Net Neutrality DebateFrom the onset, it would be prudent to note that net neutrality, as Laudon and Laudon (2020) point out, could simply be defined as “the idea that Internet service providers must allow customers equal access to content and applications, regardless of the source or nature of the content” (265). This is more or less the same meaning that Mapua (2016) assigns to

Sign Up for Unlimited Study Help

Our semester plans gives you unlimited, unrestricted access to our entire library of resources —writing tools, guides, example essays, tutorials, class notes, and more.

Get Started Now