<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>What's the Scoop, Wei-Hao?</title>
	<atom:link href="http://whatsthescoopweihao.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://whatsthescoopweihao.wordpress.com</link>
	<description></description>
	<lastBuildDate>Fri, 25 Jul 2008 13:53:02 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='whatsthescoopweihao.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>What's the Scoop, Wei-Hao?</title>
		<link>http://whatsthescoopweihao.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://whatsthescoopweihao.wordpress.com/osd.xml" title="What&#039;s the Scoop, Wei-Hao?" />
	<atom:link rel='hub' href='http://whatsthescoopweihao.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Ideology-O-Meter</title>
		<link>http://whatsthescoopweihao.wordpress.com/2008/07/25/ideology-o-meter/</link>
		<comments>http://whatsthescoopweihao.wordpress.com/2008/07/25/ideology-o-meter/#comments</comments>
		<pubDate>Fri, 25 Jul 2008 13:53:02 +0000</pubDate>
		<dc:creator>Max</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://whatsthescoopweihao.wordpress.com/?p=25</guid>
		<description><![CDATA[One of the application of automatic ideology analysis in my PhD thesis work is to predict the ideological perspective from which an article is written.  I make a web-based demo, Ideology-O-Meter, that takes a input text on the Israeli-Palestinian conflict and analyze how likely the text is written from the Israeli or the Palestinian perspective. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=25&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of the application of automatic ideology analysis in my PhD thesis work is to predict the ideological perspective from which an article is written.  I make a web-based demo, <a title="Ideology-O-Meter" href="http://weihaolinatcmu.googlepages.com/ideology-o-meter">Ideology-O-Meter</a>, that takes a input text on the Israeli-Palestinian conflict and analyze how likely the text is written from the Israeli or the Palestinian perspective.</p>
<div id="attachment_26" class="wp-caption alignnone" style="width: 310px"><a href="http://whatsthescoopweihao.files.wordpress.com/2008/07/ideology-o-meter.png"><img class="size-medium wp-image-26" src="http://whatsthescoopweihao.files.wordpress.com/2008/07/picture.png?w=300&#038;h=251" alt="Ideology-O-Meter" width="300" height="251" /></a><p class="wp-caption-text">Ideology-O-Meter</p></div>
<p>There are three panels in the <a title="Ideology-O-Meter" href="http://http://weihaolinatcmu.googlepages.com/ideology-o-meter">Ideology-O-Meter</a> demo.  In the left panel, you can type any text you would like to identify its ideological perspective on the Israeli-Palestinian conflict.  I prepare example texts written by real Israeli and Palestinian authors on the <a title="bitterlemons" href="http://www.bitterlemons.org/">bitterlemons.org</a>.  You can use these examples by clicking one of the two buttons above the text box.  After you press the Identify button at the bottom, the input text will be sent to the automatic ideology analysis program running in the background.  The program will parse the text, and infer the likelihood of expressing ideological beliefs using the<a title="Joint Topic and Perspective Model" href="http://whatsthescoopweihao.wordpress.com/2008/06/26/joint-topic-and-perspective-model/"> Joint Topic and Perspective Model</a>.</p>
<p>The results are shown in the middle and right panels.  The middle panel is the Ideology-O-Meter, and the position of arrow indicates how strongly the input text conveys one of the two ideological perspectives.  The more extremely a text expresses the Israeli view, the more the arrow moves to the right.  Similarly, the more extremely a text expresses the Palestinian view, the more the arrow moves to the left.  In the above example, the text appears to be written very much from the Palestinian perspective.</p>
<p>The third panel lists the top 10 more frequent words in the input text and their frequencies (in dark yellow).  The longer the bar, the more frequently the word appears in the input text.  The light yellow bar is the expected frequency in articles written from the Palestinian perspective that the Joint Topic and Perspective Model learns from the <a href="http://weihaolinatcmu.googlepages.com/data">bitterlemons corpus</a>.  The closer the two bars in proportion, the more likely the input text is written from the Palestinian perspective.</p>
<p>You can read more about the statisitcal model behind the scene in our coming <a href="http://weihaolinatcmu.googlepages.com/lin08jtp.pdf">ECML paper</a>.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/whatsthescoopweihao.wordpress.com/25/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/whatsthescoopweihao.wordpress.com/25/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/whatsthescoopweihao.wordpress.com/25/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/whatsthescoopweihao.wordpress.com/25/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/whatsthescoopweihao.wordpress.com/25/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=25&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://whatsthescoopweihao.wordpress.com/2008/07/25/ideology-o-meter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Max</media:title>
		</media:content>

		<media:content url="http://whatsthescoopweihao.files.wordpress.com/2008/07/picture.png?w=300" medium="image">
			<media:title type="html">Ideology-O-Meter</media:title>
		</media:content>
	</item>
		<item>
		<title>Joint Topic and Perspective Model</title>
		<link>http://whatsthescoopweihao.wordpress.com/2008/06/26/joint-topic-and-perspective-model/</link>
		<comments>http://whatsthescoopweihao.wordpress.com/2008/06/26/joint-topic-and-perspective-model/#comments</comments>
		<pubDate>Thu, 26 Jun 2008 18:40:49 +0000</pubDate>
		<dc:creator>Max</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://whatsthescoopweihao.wordpress.com/?p=9</guid>
		<description><![CDATA[As part of my PhD thesis, I have been working on developing statistical models for ideological text and video. By ideology I mean &#8220;a set of beliefs commonly shared by a group of people.&#8221; For example, &#8220;pro-life&#8221; and &#8220;pro-choice&#8221; are two main ideologies on the abortion issue. Automatically analyzing ideological text has been considered almost [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=9&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>As part of my PhD thesis, I have been working on developing statistical models for ideological text and video.  By ideology I mean &#8220;a set of beliefs commonly shared by a group of people.&#8221; For example, &#8220;pro-life&#8221; and &#8220;pro-choice&#8221; are two main ideologies on the abortion issue.</p>
<p>Automatically analyzing ideological text has been considered almost impossible.  Abelson, who is a pioneer in computer modeling of ideological beliefs and first develops a computer simulation of the conservative beliefs of Barry Goldwater, expressed a very pessimistic view on <em>automatically</em> analyzing ideological text in the sixties of the twentieth century:</p>
<blockquote><p>The simulation of the belief systems of other individuals with very different views is also being contemplated, but this step cannot be undertaken lightly since the paraphrasing procedure is extremely difficult.  One might suppose that fully automatic content analysis methods could be applied to the writings and speeches of public figures, but there is an annoying technical problem which renders this possibility a vain hope.</p></blockquote>
<p>I deeply admire Abelson&#8217;s vision of computer modeling of ideological beliefs.  Such a computer system will enable news aggregation web sites such as Google News to better organize news and blogs by ideological perspectives than simply presenting a huge cluster of news stories.  Such a computer system can also identify highly biased news articles and raise the awareness of individual newspapers and television broadcasters&#8217; biases.</p>
<p>However, I do not subscribe his view on automatic analysis of ideological text.  I have observed an unique emphatic patterns of word choices in many ideological texts, and  develop a statistical model that simultaneously capture two factors, <em>topical</em> and <em>ideological</em>, that contribute to words choices made by authors holding contrasting ideological beliefs.</p>
<ul>
<li>Topical: Ideology is situated in a specific topic.  &#8220;Pro-life&#8221; ideology is relevant mostly in news articles about pregnancy but less relevant in articles about baseball. Some words will be chosen because they are about the topic.</li>
<li>Ideological: Authors or speakers holding different ideological beliefs emphasize some words (write or speak more) and de-emphasize (write or speak less) the other words when they express ideological views on an issue.</li>
</ul>
<p>I thus call this statistical model for ideological discourse a Joint Topic and Perspective Model (jTP).</p>
<p>Here is an example of fitting jTP on the editorials about the Israeli-Palestinian conflict published on the <a href="http://www.bitterlemons.org/" target="_blank">bitterlemons.org</a>. I summarize the topical and ideological factors uncovered by jTP in a color text cloud.  A word&#8217;s size is proportional to its topical factor (i.e., how much its occurrence is attributed to the topic), and a word&#8217;s color depth is proportional to its ideological factor (i.e., how much its occurrence is attributed to the ideological perspective of an author).  The &#8220;neural&#8221; words that are particularly emphasized by either size are painted light gray.  Words chosen more often by the Israeli authors are painted <span style="color:#ff0000;">red</span>, and words used more often by the Palestinian authors are painted <span style="color:#333399;">blue</span>.</p>
<p><img src="http://whatsthescoopweihao.files.wordpress.com/2008/06/israeli_palestinian_jtp1_text_cloud.png?w=347&#038;h=301" alt="" width="347" height="301" /><a href="http://whatsthescoopweihao.files.wordpress.com/2008/06/israeli_palestinian_jtp1_text_cloud.png"> </a></p>
<p>Topical words (in large size) such as &#8220;Palestinian&#8221; and &#8220;Israeli&#8221; are not surprisingly chosen very often by both sides.  These topical words, however, are not particularly emphasized by either side.  The Israeli and Palestinian perspectives are clearly reflected in their word choices.  The Israeli authors choose more &#8220;terrorism&#8221;, while the Palestinian authors choose more &#8220;occupation&#8221; and &#8220;resistance.&#8221;  Interestingly, &#8220;Arafat&#8221;, a former Palestinian leader, is mentioned more often by the Israeli authors than the Palestinian authors.</p>
<p>Here is another example of fitting jTP on the speech transcripts of the 2000 and 2004 United States presidential debates.  Words emphasized by the Democratic presidential candidates are pained <span style="color:#ff0000;">red</span>, and words emphasized by the Republican presidential candidates are painted <span style="color:#333399;">blue</span>.</p>
<p><img src="http://whatsthescoopweihao.files.wordpress.com/2008/06/democrat_republican_jtp1_text_cloud.png?w=321&#038;h=265" alt="" width="321" height="265" /></p>
<p>The Democratic presidential candidates choose more &#8220;families&#8221; and &#8220;kids&#8221;, while the Republican presidential candidates choose more &#8220;freedom&#8221; and &#8220;Washington.&#8221;</p>
<p>These examples show that ideological beliefs are very much reflected in word choices made by an author or a speaker.  By modeling these statistical patterns, computers can &#8220;learn&#8221; how ideological perspectives are reflected in word choices from a large collection of documents.</p>
<p>You can find more details of the Joint Topic and Perspective Model in our <a title="Joint Topic and Perspective Paper" href="http://weihaolinatcmu.googlepages.com/lin08jtp.pdf">paper</a> accepted in the coming 2008 European Conference on Machine Learning.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/whatsthescoopweihao.wordpress.com/9/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/whatsthescoopweihao.wordpress.com/9/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/whatsthescoopweihao.wordpress.com/9/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/whatsthescoopweihao.wordpress.com/9/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/whatsthescoopweihao.wordpress.com/9/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=9&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://whatsthescoopweihao.wordpress.com/2008/06/26/joint-topic-and-perspective-model/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Max</media:title>
		</media:content>

		<media:content url="http://whatsthescoopweihao.files.wordpress.com/2008/06/israeli_palestinian_jtp1_text_cloud.png" medium="image" />

		<media:content url="http://whatsthescoopweihao.files.wordpress.com/2008/06/democrat_republican_jtp1_text_cloud.png" medium="image" />
	</item>
		<item>
		<title>Hello, World</title>
		<link>http://whatsthescoopweihao.wordpress.com/2008/06/15/hello-world/</link>
		<comments>http://whatsthescoopweihao.wordpress.com/2008/06/15/hello-world/#comments</comments>
		<pubDate>Sun, 15 Jun 2008 17:17:58 +0000</pubDate>
		<dc:creator>Max</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://whatsthescoopweihao.wordpress.com/?p=5</guid>
		<description><![CDATA[One of my favorite constants is . I still don&#8217;t know how to tell my mom the in the normal distribution actually has something to do with a circle.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=5&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of my favorite constants is <img src='http://s0.wp.com/latex.php?latex=%5Cpi&amp;bg=ffffff&amp;fg=1c1c1c&amp;s=0' alt='&#92;pi' title='&#92;pi' class='latex' />.  I still don&#8217;t know how to tell my mom the <img src='http://s0.wp.com/latex.php?latex=%5Cpi&amp;bg=ffffff&amp;fg=1c1c1c&amp;s=0' alt='&#92;pi' title='&#92;pi' class='latex' /> in the normal distribution actually has something to do with a circle.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/whatsthescoopweihao.wordpress.com/5/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/whatsthescoopweihao.wordpress.com/5/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/whatsthescoopweihao.wordpress.com/5/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/whatsthescoopweihao.wordpress.com/5/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/whatsthescoopweihao.wordpress.com/5/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=whatsthescoopweihao.wordpress.com&amp;blog=3985183&amp;post=5&amp;subd=whatsthescoopweihao&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://whatsthescoopweihao.wordpress.com/2008/06/15/hello-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Max</media:title>
		</media:content>
	</item>
	</channel>
</rss>
