<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss'><id>tag:blogger.com,1999:blog-7987606611891782827</id><updated>2009-11-14T18:15:49.274-08:00</updated><title type='text'>Searching for A.I</title><subtitle type='html'>The best way to predict the future is to invent it.   -Alan Kay 1971</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>18</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-7440106962132679525</id><published>2009-08-30T19:40:00.001-07:00</published><updated>2009-08-31T01:27:44.876-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='geek interest'/><category scheme='http://www.blogger.com/atom/ns#' term='A.I.'/><category scheme='http://www.blogger.com/atom/ns#' term='computing'/><title type='text'>Reversible Computing: Sometimes the best step forward is backward</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zX7ecYghAHA/SptVlhRPh3I/AAAAAAAAEis/6UI2jl3Kp40/s1600-h/computer_cooling.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 240px; height: 320px;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/SptVlhRPh3I/AAAAAAAAEis/6UI2jl3Kp40/s320/computer_cooling.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5375984683271030642" /&gt;&lt;/a&gt;&lt;br /&gt;I have been fascinated with the idea of &lt;a href="http://en.wikipedia.org/wiki/Reversible_computing"&gt;Reversible Computing&lt;/a&gt; after being introduced to it recently while reading &lt;a href="http://en.wikipedia.org/wiki/Raymond_Kurzweil"&gt;Ray Kurzweil&lt;/a&gt;'s book on the &lt;a href="http://singularity.com/"&gt;Singularity&lt;/a&gt;. This post is a quick primer on the subject for folks discovering this late(like me). One of the exciting reasons for implementing reversible computing is that they offer a way to build extremely energy efficient computers. As per the &lt;a href="http://en.wikipedia.org/wiki/Von_Neumann-Landauer_limit"&gt;Neumann-Landauer limit&lt;/a&gt;, every irreversible bit operation releases energy of kT ln 2 (K being Boltzmann's constant &amp; T being temperature). The key idea here is to not destroy input states but simply move them around and not release additional information/energy to the environment.&lt;br /&gt;&lt;br /&gt;A necessary condition for reversibility is that the transition function mapping states from input to output should be one-to-one. This makes sense since if the function was many-to-one, its not possible to recreate state t from state t+1. Its easy to see that AND, OR gates are not reversible (For OR, 01,11,11 all map to 1). In a normal gate, input states are lost since there is less information in the output than the input and this lost information is released as heat. Since charges are grounded and flow away, energy is lost. In a reversible gate, input information is not lost and thus energy is not released. &lt;br /&gt;&lt;br /&gt;The NOT gate is reversible and Landauer (IBM) showed that the NOT operation could be performed without putting energy in or taking heat out. Think about the implications of this for a second. But the NOT gate is of course not universal.&lt;br /&gt;&lt;br /&gt;The &lt;a href="http://en.wikipedia.org/wiki/Fredkin_gate"&gt;Fredkin&lt;/a&gt; &amp; &lt;a href="http://en.wikipedia.org/wiki/Toffoli_gate"&gt;Toffoli&lt;/a&gt; gates are examples of reversible gates that are also universal. &lt;br /&gt;&lt;br /&gt;The Fredkin gate is a simple controlled swap gate. i.e. if one of the bits (the control bit) is 1, the other 2 input bits are swapped in the output. The state of the control bit is preserved.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_zX7ecYghAHA/SptUardk2yI/AAAAAAAAEik/RqOvEPoVfG8/s1600-h/Fredkingate.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 291px; height: 283px;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/SptUardk2yI/AAAAAAAAEik/RqOvEPoVfG8/s320/Fredkingate.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5375983397516925730" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The logic function is,&lt;br /&gt;For inputs A, B, C and outputs P, Q, R&lt;br /&gt;P=A&lt;br /&gt;Q=B XOR Swap&lt;br /&gt;R=C XOR Swap&lt;br /&gt;Swap=(B XOR C) AND A&lt;br /&gt;&lt;br /&gt;|A|B|C|P|Q|R|&lt;br /&gt;|1|0|0|1|0|0|&lt;br /&gt;|1|0|1|1|1|0|&lt;br /&gt;|1|1|0|1|0|1|&lt;br /&gt;|1|1|1|1|1|1|&lt;br /&gt;|0|0|0|0|0|0|&lt;br /&gt;|0|0|1|0|0|1|&lt;br /&gt;|0|1|0|0|1|0|&lt;br /&gt;|0|1|1|0|1|1|&lt;br /&gt;&lt;br /&gt;Points to note from the truth table,&lt;br /&gt;1. The number of 0s and 1s are preserved from input to output, showing&lt;br /&gt;how the model is not wasteful.&lt;br /&gt;&lt;br /&gt;2. If you feed back the output you get the input state that created it.&lt;br /&gt;(trivially for the last 4 rows)&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;So why is reversible computing interesting?&lt;/span&gt;&lt;br /&gt;Current computing paradigms rely on irreversible computing where we destroy the input states as we move to subsequent states, storing only the intermediate results that are needed. When we selectively erase input information, energy is released as heat to the surrounding environment thus increasing its entropy. With reversible computing the input bit stays in the computer but just changes location (see truth table above), hence releasing no heat into the environment and requiring no energy from the outside environment.&lt;br /&gt;&lt;br /&gt;Some caveats as pointed out by Ray,&lt;br /&gt;1. Even though in theory energy might not be required for computation, we will still need energy for transmitting the results of the computation which is a irreversible process.&lt;br /&gt;&lt;br /&gt;2. Logic operations have an inherent error rate. Standard error detection and correction codes are irreversible and these will dissipate energy.&lt;br /&gt;&lt;br /&gt;Even if we can't get to the ideal theoretical limit of reversible computing in terms of energy efficient processing, we can get close which should be extremely exciting!&lt;br /&gt; &lt;br /&gt;&lt;small&gt;&lt;br /&gt;[Image courtesy:http://bit-player.org/wp-content/Fredkingate.png,&lt;br /&gt;http://www.ocam.cl/blog/wp-content/uploads/2007/11/computer_cooling.jpg]&lt;br /&gt;&lt;/small&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-7440106962132679525?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/7440106962132679525/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=7440106962132679525' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/7440106962132679525'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/7440106962132679525'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2009/08/reversible-computing-sometimes-best.html' title='Reversible Computing: Sometimes the best step forward is backward'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/SptVlhRPh3I/AAAAAAAAEis/6UI2jl3Kp40/s72-c/computer_cooling.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-6429046575657551503</id><published>2009-06-27T14:50:00.000-07:00</published><updated>2009-07-05T23:32:15.044-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Bing'/><category scheme='http://www.blogger.com/atom/ns#' term='entrepreneurship'/><category scheme='http://www.blogger.com/atom/ns#' term='startups'/><category scheme='http://www.blogger.com/atom/ns#' term='Web Search'/><category scheme='http://www.blogger.com/atom/ns#' term='classification'/><category scheme='http://www.blogger.com/atom/ns#' term='Google'/><category scheme='http://www.blogger.com/atom/ns#' term='Yahoo'/><category scheme='http://www.blogger.com/atom/ns#' term='Microsoft'/><category scheme='http://www.blogger.com/atom/ns#' term='search engine'/><title type='text'>A (Classification(Classification(Search Engines)))</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_zX7ecYghAHA/Sk1Ixx5ByaI/AAAAAAAAEdI/uX2mGmfB7iE/s1600-h/search-engines.gif"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 300px; height: 297px;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/Sk1Ixx5ByaI/AAAAAAAAEdI/uX2mGmfB7iE/s320/search-engines.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5354015552056707490" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Innovation in Search is far from asymptoting. I think we are going to see a lot of exciting next steps in Web Search in the coming years. There is a series of bets getting made on what the next disruptive step would be. &lt;br /&gt;&lt;br /&gt;*User generated content (data generated by twitter, facebook, delicious, youtube etc)&lt;br /&gt;*Personalization (disambiguate user intent better, shorter queries etc)&lt;br /&gt;*Real time Search (fresher search, search twitter etc)&lt;br /&gt;*Size (search through more documents, indexing the deep web, other data formats etc)&lt;br /&gt;*Semantics (better understanding of documents, queries, user intent etc)&lt;br /&gt;are all getting a lot of attention &amp; investment.&lt;br /&gt;&lt;br /&gt;This post is a classification of classification of Search Engines. Whenever I hear of a new search engine I subconsciously try to classify it based on  a set of critera and it helps me see it in the context of its neighbors in that multidimensional space :). In this post I wanted to touch upon some of those criteria. Although this is a classification(classification) of Search Engines, I am being intentionally sloppy and have written this mainly from the context of the dimensions along which one can innovate in Search. For eg. for category 5. (visualization) one of the classes is the default paradigm of 10 blue links that I have not bothered to note. The goal here is to look at the search landscape and see the dimensions along which search upstarts are challenging the old guard. Examples are suggestive and not comprehensive. &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;1.By type of content searched for:&lt;br /&gt;&lt;/span&gt;By type here I mean the multimedia content type of the results being returned. Bear in mind that what is actually getting indexed might be text (as in often the case for image search, video search etc).&lt;br /&gt;Audio -&lt;a href="http://www.last.fm/"&gt;last.fm&lt;/a&gt;,&lt;a href="http://www.playlist.com/"&gt;playlist&lt;/a&gt;,&lt;a href="http://www.pandora.com/"&gt;pandora&lt;/a&gt;&lt;br /&gt;Video - &lt;a href="http://www.youtube.com/"&gt;youtube&lt;/a&gt;, &lt;a href="http://www.metacafe.com/"&gt;metacafe&lt;/a&gt;, &lt;a href="http://www.vimeo.com/"&gt;vimeo &lt;/a&gt;&lt;br /&gt;Images - &lt;a href="http://www.like.com/"&gt;Like.com &lt;/a&gt;&lt;br /&gt;Web pages - &lt;a href="http://www.google.com/"&gt;Google&lt;/a&gt;, &lt;a href="http://search.yahoo.com"&gt;Yahoo&lt;/a&gt;, &lt;a href="http://bing.com"&gt;Bing&lt;/a&gt;, &lt;a href="http://ask.com"&gt;Ask&lt;/a&gt; &lt;span style="font-weight:bold;"&gt;&lt;br /&gt;&lt;br /&gt;2. By Specific Information Need/Purpose:&lt;/span&gt;&lt;br /&gt;A Search Engine that solves a specific information need better than a General Purpose Web Search Engine.&lt;br /&gt;Health - &lt;a href="http://webmd.com"&gt;WebMD&lt;/a&gt;&lt;br /&gt;Shopping- &lt;a href="http://amazon.com"&gt;Amazon&lt;/a&gt;, &lt;a href="http://ebay.com"&gt;Ebay&lt;/a&gt;, &lt;a href="http://thefind.com"&gt;thefind&lt;/a&gt;&lt;br /&gt;Travel - &lt;a href="http://expedia.com"&gt;Expedia&lt;/a&gt;&lt;br /&gt;Real Estate - &lt;a href="http://trulia.com"&gt;Trulia&lt;/a&gt;&lt;br /&gt;Entertainment -&lt;a href="http://youtube.com"&gt;Youtube&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;3. Novel Ranking and/or Indexing Methods:&lt;/span&gt;&lt;br /&gt;By leveraging features that are not used by current General purpose Web search engines. This is hardest way to compete with the incumbent Search Engines. Startups need to overcome several disadvantages to be able to even set up a meaningful comparison with the big guys. Disadvantages like data (queries, clicks etc), index size, spam data etc. &lt;br /&gt;Natural Language Search - &lt;a href="http://powerset.com"&gt;Powerset&lt;/a&gt;&lt;br /&gt;Semantic Search - &lt;a href="http://hakia.com"&gt;Hakia&lt;/a&gt;&lt;br /&gt;Scalable Indexing- &lt;a href="http://cuil.com"&gt;Cuil&lt;/a&gt;&lt;br /&gt;Personalized Search -&lt;a href="http://igoogle.com"&gt;Kaltix&lt;/a&gt;&lt;br /&gt;Real Time Search - &lt;a href="http://search.twitter.com"&gt;Twitter&lt;/a&gt;, &lt;a href="http://tweetnews.appspot.com/"&gt;Tweetnews&lt;/a&gt;, &lt;a href="http://tweetmeme.com/"&gt;Tweetmeme&lt;/a&gt; etc  &lt;br /&gt;Sometimes this can be in the context of vertical search engines also. For eg. for searching for restaurants using a feature like ratings might be useful which is not cleanly available to general purpose search engines but its a feature someone like &lt;a href="http://yelp.com"&gt;Yelp&lt;/a&gt; might exploit.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;4. Searching content that is not crawlable by General Purpose search engines:&lt;/span&gt;&lt;br /&gt;Typically in these cases, the service containing the search engine generates its own data. For eg. Youtube, twitter, Facebook etc. But sometimes the data might be obtained via an API as in the case of the variety of Twitter Search Engines.&lt;br /&gt;Videos - &lt;a href="http://youtube.com"&gt;Youtube&lt;/a&gt;&lt;br /&gt;Status Messages, link sharing - &lt;a href="http://twitter.com"&gt;Twitter&lt;/a&gt;, &lt;a href="http://facebook.com"&gt;Facebook&lt;/a&gt;&lt;br /&gt;Data in charts &amp; other parts of the Deep Web - Wolfram Alpha (some of the data seems to have been acquired at some cost)&lt;br /&gt;Job Search - &lt;a href="http://www.linkedin.com/"&gt;Linkedin&lt;/a&gt;, &lt;a href="https://login.yahoo.com/config/login?.src=hotjobs&amp;.intl=us&amp;.done=http%3A%2F%2Fhotjobs.yahoo.com%2F&amp;.pd=c%3DGYFWD..p2e7A5pZdpJKwRtA-"&gt;Hotjobs&lt;/a&gt; etc&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;5. Visualization:&lt;/span&gt;&lt;br /&gt;Innovating on the search result presentation front. &lt;br /&gt;&lt;a href="http://www.grokker.com/"&gt;Grokker&lt;/a&gt; (Map View)&lt;br /&gt;&lt;a href="http://www.searchme.com/"&gt;Searchme&lt;/a&gt; (Coverflow like search result presentation)&lt;br /&gt;&lt;a href="http://clusty.com/"&gt;Clusty &lt;/a&gt; (Document clustering by topic)&lt;br /&gt;&lt;a href="http://www.snap.com/"&gt;Snap&lt;/a&gt; (Thumbnail previews)&lt;br /&gt;&lt;a href="http://www.kosmix.com"&gt;Kosmix&lt;/a&gt; (Automatic information aggregation from multiple sources)&lt;br /&gt;&lt;a href="http://www.google.com/squared"&gt;Google Squared&lt;/a&gt; &lt;br /&gt;Some conversational/dialogue interface based systems could also fall under this category.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;6. Regionalization/Localization:&lt;br /&gt;&lt;/span&gt;Regionalization/Localization could mean,&lt;br /&gt;a.Better handling of the native language's character set (tokenization, stemming etc). The CJK languages (Chinese, Japanese, Korean) present unique challenges with word segmentation, named entity recognition etc. &lt;br /&gt;b.Capturing any cultural biases&lt;br /&gt;c.Blending in local &amp; global content appropriately for search results. (This was my research project at Yahoo! Search. Will describe this problem in more detail in a subsequent post). For eg. for a query 'buying cars' issued by a UK user, we don't want to show listings of US cars. But if that same user queried for 'neural networks' we don't care if the result is a US website.&lt;br /&gt;Crawling, indexing &amp; ranking need some regionalization/localization and sometimes local search engines can challenge the larger search engines here. &lt;br /&gt;China -&lt;a href="http://www.baidu.com/"&gt;Baidu&lt;/a&gt;&lt;br /&gt;India -&lt;a href="http://www.guruji.com/"&gt;Guruji&lt;/a&gt;   &lt;br /&gt;&lt;br /&gt;Let me know if you see other forms of classfications and I'll update the post to reflect it. From a business perspective, I think there 4 main things to consider while building a new search engine.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;A. How likely is to build a disruptive user experience?&lt;/span&gt; (Significantly better ranking, user experience etc. than the default Web Search Engine of choice. The delta is very likely to be much more than our first quess :))&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;B. How big will the impact be? &lt;/span&gt;(% of query stream impacted, $ value of impacted query stream etc)&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;C. How easy is to replicate?&lt;/span&gt; (Search is not sticky. A simple feature will get copied in no time and leave you in the cold.)&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;D. Accuracy of the binary query classifier the user would need to have(in her mind) to know when to use your search engine&lt;/span&gt; (for a general purpose search engine this is trivial but for other specific vertical/niche engines this is important). In English, this would be clarity of purpose.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_zX7ecYghAHA/Sk1xntgS8YI/AAAAAAAAEdQ/DnMGC0jEooE/s1600-h/logo.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 276px; height: 90px;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/Sk1xntgS8YI/AAAAAAAAEdQ/DnMGC0jEooE/s320/logo.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5354060459057279362" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Category 3. is definitely the playground where the big guys play and in my opinion the most exciting. Its where you are exposed to the &lt;a href="http://en.wikipedia.org/wiki/Black_swan_theory"&gt;black swans&lt;/a&gt;. Of course reward &amp; risk are generally proportional and this is also where the riskiest bets are made. My company, &lt;a href="http://www.infoaxe.com"&gt;Infoaxe&lt;/a&gt; will be entering category 3 in the next 2 months or so. This is a stealth project so I can't talk more about it at this time. I'll post an update once we're live.&lt;br /&gt;&lt;br /&gt;&lt;small&gt;&lt;br /&gt;[Image courtesy: http://www.webdesignedge.net/our-blog/wp-content/uploads/2009/06/search-engines.jpg]&lt;br /&gt;&lt;/small&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-6429046575657551503?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/6429046575657551503/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=6429046575657551503' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6429046575657551503'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6429046575657551503'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2009/06/classificationclassificationsearch.html' title='A (Classification(Classification(Search Engines)))'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_zX7ecYghAHA/Sk1Ixx5ByaI/AAAAAAAAEdI/uX2mGmfB7iE/s72-c/search-engines.gif' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-3262144013160876954</id><published>2009-03-24T00:04:00.000-07:00</published><updated>2009-03-24T02:14:11.523-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Stanford'/><category scheme='http://www.blogger.com/atom/ns#' term='autonomous navigation'/><category scheme='http://www.blogger.com/atom/ns#' term='A.I.'/><category scheme='http://www.blogger.com/atom/ns#' term='infoaxe'/><category scheme='http://www.blogger.com/atom/ns#' term='artificial intelligence'/><category scheme='http://www.blogger.com/atom/ns#' term='junior'/><title type='text'>Drivers Optional....Safety Mandatory ;)</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zX7ecYghAHA/SciPz6PhOfI/AAAAAAAAEa0/rEleTGJVY_g/s1600-h/darpa_urban_challenge_630px.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 165px;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/SciPz6PhOfI/AAAAAAAAEa0/rEleTGJVY_g/s320/darpa_urban_challenge_630px.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5316657482081319410" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I was having a conversation recently with &lt;a href="http://vijaykrishnan.blogspot.com/"&gt;Vijay Krishnan&lt;/a&gt;, my co-founder at &lt;a href="http://www.infoaxe.com"&gt;Infoaxe&lt;/a&gt; about autonomous cars. This had always been a pet fancy of mine (and which also &lt;a href="http://ieeexplore.ieee.org/Xplore/login.jsp?url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F9872%2F31388%2F01460695.pdf%3Fisnumber%3D31388%26prod%3DCNF%26arnumber%3D1460695%26arSt%3D827%26ared%3D832%26arAuthor%3DT.%2BSrinivasari%253B%2BJ.B.S.%2BJonathan%253B%2BA.%2BChandrasekhar&amp;authDecision=-203"&gt;began&lt;/a&gt; my love affair with A.I) and I was recently reminded of my strong feelings on the subject after I had an accident on 101 where I ended up totaling my car.  &lt;br /&gt;  &lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_zX7ecYghAHA/SciNEP3SzNI/AAAAAAAAEas/ytDvQtcerns/s1600-h/IMG00255.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/SciNEP3SzNI/AAAAAAAAEas/ytDvQtcerns/s320/IMG00255.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5316654464228314322" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I ended up rear ending a car that had suddenly stopped due to traffic. &lt;span style="font-weight:bold;"&gt;I felt that in this case, a human had been placed inside of a loop that he did not belong in.&lt;/span&gt; Obviously, humans should be removed out of all mission critical loops. (We do this at Infoaxe all the time ;) I believe human intelligence is best applied to creating more &amp; more loops that we can get ourselves out of completely. It is the topic of another post as to whether its possible to get ourselves out of that meta-loop altogether ;P) In this case, the car should have had sensors in the front constantly tracking distance to the car in front with appropriate visual warnings when there is a violation. Most importantly the car should automatically apply the brakes when the distance between the cars is closed at an abnormal speed. &lt;span style="font-style:italic;"&gt;Never ever rely on humans to react in time!&lt;/span&gt;&lt;br /&gt; &lt;br /&gt;I have been really impressed with progress that folks have been making with autonomous driving especially with the &lt;a href="http://www.darpa.mil/grandchallenge/index.asp"&gt;Darpa Grand Challenge&lt;/a&gt;. There have been 3 of them so far, with the last event in '07. Check out the video below for some early practice runs by Junior of the &lt;a href="http://cs.stanford.edu/group/roadrunner/"&gt;Stanford Racing Team&lt;/a&gt;.&lt;br /&gt;&lt;br&gt; &lt;br /&gt;&lt;object width="425" height="344"&gt;&lt;param name="movie" value="http://www.youtube.com/v/BSS0MZvoltw&amp;hl=en&amp;fs=1"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;/param&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;/param&gt;&lt;embed src="http://www.youtube.com/v/BSS0MZvoltw&amp;hl=en&amp;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;As Thrun remarks in the video ~42,000/yr die in the US due to car accidents and most of these are due to human error. &lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zX7ecYghAHA/SciW6VxPdwI/AAAAAAAAEa8/z2leG0H2s94/s1600-h/stanford_jun5_550x367.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 214px;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/SciW6VxPdwI/AAAAAAAAEa8/z2leG0H2s94/s320/stanford_jun5_550x367.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5316665289131128578" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;For the geeks, Junior perceives the environment through a combination of &lt;a href="http://en.wikipedia.org/wiki/LIDAR"&gt;Lidars&lt;/a&gt; &amp; video cameras. Here's an article from &lt;a href="http://news.cnet.com/2300-11394_3-6160176-1.html?tag=mncol"&gt;CNET&lt;/a&gt; with some more photos under the hood of Junior. And one more &lt;a href="http://www.educatingsilicon.com/2007/11/03/spot-the-sensor/"&gt;here&lt;/a&gt; with some fun facts about the plethora of sensors used.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-3262144013160876954?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/3262144013160876954/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=3262144013160876954' title='10 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/3262144013160876954'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/3262144013160876954'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2009/03/drivers-optionalsafety-mandatory.html' title='Drivers Optional....Safety Mandatory ;)'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/SciPz6PhOfI/AAAAAAAAEa0/rEleTGJVY_g/s72-c/darpa_urban_challenge_630px.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>10</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-804547416801943471</id><published>2008-11-08T17:47:00.000-08:00</published><updated>2009-05-15T17:50:32.456-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web history search engine'/><category scheme='http://www.blogger.com/atom/ns#' term='infoaxe'/><category scheme='http://www.blogger.com/atom/ns#' term='bookmarking'/><category scheme='http://www.blogger.com/atom/ns#' term='web memory'/><title type='text'>Infoaxe - A Search Engine for your Web Memory</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_zX7ecYghAHA/SRZDk4JyDII/AAAAAAAAEUk/zGNMXr9Jx0Y/s1600-h/logo.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 277px; height: 87px;" src="http://2.bp.blogspot.com/_zX7ecYghAHA/SRZDk4JyDII/AAAAAAAAEUk/zGNMXr9Jx0Y/s320/logo.png" alt="" id="BLOGGER_PHOTO_ID_5266471115084139650" border="0"&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://i.stanford.edu/%7Ekvijay/"&gt;Vijay Krishnan&lt;/a&gt; and I have been working on &lt;a href="http://www.infoaxe.com"&gt;Infoaxe&lt;/a&gt; for a while now and we are happy to be releasing an alpha version of our &lt;a href="http://www.infoaxe.com"&gt;search engine&lt;/a&gt; to the public.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Watch the quick &lt;span style="font-weight:bold;"&gt;Infoaxe Demo Video&lt;/span&gt; below,&lt;br /&gt;&lt;br /&gt;&lt;object width="560" height="340"&gt;&lt;param name="movie" value="http://www.youtube.com/v/Mxth6huXVDM&amp;hl=en&amp;fs=1"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;/param&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;/param&gt;&lt;embed src="http://www.youtube.com/v/Mxth6huXVDM&amp;hl=en&amp;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="400" height="340"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.infoaxe.com"&gt;Infoaxe&lt;/a&gt; (http://www.infoaxe.com), is a &lt;span style="font-weight: bold;"&gt;Personal Browsing History Search Engine.&lt;/span&gt; With Infoaxe every page that you visit on the Internet gets added to a collection called your &lt;span style="font-weight: bold;"&gt;Personal Web Memory&lt;/span&gt; and infoaxe makes this collection, searchable across all the computers you use. Thanks to Infoaxe, there is no need to ever bookmark a page again. It makes getting back to web pages seen in the past (like videos, news articles etc) extremely fast and easy. Infoaxe also lets you 'pivot' around web pages seen in the past to see other pages that you visited at the same time. Tagging and sharing pages from your web memory with your friends is also very convenient with Infoaxe.&lt;br /&gt;&lt;br /&gt;We have been receiving rave reviews from our existing users and hope you find Infoaxe compelling as well and join the growing number of happy Infoaxe users!&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;Why Infoaxe?&lt;/span&gt;&lt;br /&gt;The Infoaxe story began when Vijay and I were graduate students in Computer Science at &lt;a href="http://www.stanford.edu"&gt;Stanford&lt;/a&gt;, as a result of the increasing problem we faced, of being able to keep track of interesting and useful information on the Web. The Web is growing rapidly and it is rapidly outgrowing the tools we are using to keep track of the Web (bookmarks, emailing links to yourself etc). Infoaxe is the next big step in this regard. We wanted to make it extremely easy for web users to keep track of their personal slice of the Web.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;What is Infoaxe?&lt;/span&gt;&lt;br /&gt;Infoaxe is a search engine for your web memory.&lt;br /&gt;Every page that you see on the Web gets added to your personal Web Memory and is now searchable.&lt;br /&gt;Your Web memory is private to you and portable (can be accessed across any computer that you use).&lt;br /&gt;You never have to bookmark a page again!&lt;br /&gt;(everything gets implicitly bookmarked and becomes searchable)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;What should I do to get started?&lt;/span&gt;&lt;br /&gt;V. Simple. &lt;span style="font-weight: bold;"&gt;Sign up for Infoaxe and install the infoaxe toolbar&lt;/span&gt;. The toolbar sends the urls to be added to&lt;br /&gt;your personal web memory so its necessary to install the toolbar on all computers you use.&lt;br /&gt;&lt;br /&gt;Go to http://www.infoaxe.com and watch the &lt;a href="http://www.infoaxe.com/infoaxe-alpha/infoaxe-alpha.htm"&gt;quick demo video&lt;/a&gt;. Check out the &lt;a href="http://www.infoaxe.com/faq.html"&gt;FAQ&lt;/a&gt; as well, which should answer most&lt;br /&gt;of your questions.&lt;br /&gt;&lt;br /&gt;Here are some cool things you can do with your Web Memory at your finger tips,&lt;br /&gt;&lt;br /&gt;    * Your &lt;span style="font-weight: bold;"&gt;web history synchronized, searchable and portable&lt;/span&gt; across all computers and browsers you use. Take it wherever you go!&lt;br /&gt;&lt;br /&gt;    * &lt;span style="font-weight: bold;"&gt;Pivot on events:&lt;/span&gt; Say, you wanted to look at all the websites you looked at when you were researching grad schools many months ago. This sounds almost impossible to accomplish with a general Web Search Engine like Google. The right query is quite hard in this case since there likely isn't one single query which will give you all the pages. You might have looked at other grad schools like MIT, CMU etc, tips for writing good grad school essays etc. infoaxe helps you here by letting you pivot around a Web page in your Web Memory. Think of this as something like time travel. You can ask infoaxe to show you all the web pages you were looking at when you were looking at the Stanford University Graduate Admissions home page. We think its more natural to remember events than dates, and pivot lets you pivot around events in your Web Memory.&lt;br /&gt;&lt;br /&gt;    * Many of our users tell us that thanks to infoaxe, their &lt;span style="font-weight: bold;"&gt;search queries to Google have become a lot shorter.&lt;/span&gt; For eg. these days to go to the website of the restaurant Siam Royal in Palo Alto, I no longer need to type " siam royal palo alto" to Google. I just type "siam royal".(That's a lie, I actually just type "siam" :) )With of our convenient Google widget, searching on Google.com displays infoaxe web memory results on the vacant right column of the Google search results page.&lt;br /&gt;&lt;br /&gt;    * Here is another example, Mary is hunting for apartments in Palo Alto. She has looked at many apartments on craigslist and rent.com. She is finding it impossible to keep track of the ones she liked. Bookmarking seems like a lot of work for so many pages and an overkill since she is sure that after this week she wouldn't really be looking at these apartments again.  Mary &lt;span style="font-weight: bold;"&gt;does not have to bookmark anything.&lt;/span&gt; If she wanted to revisit all the apartments she looked at on University Avenue, she could just search infoaxe with the query 'university avenue'.&lt;br /&gt;&lt;br /&gt;    * &lt;span style="font-weight: bold;"&gt;Tagging&lt;/span&gt; - add labels to saved web pages to help organize them better.&lt;br /&gt;&lt;br /&gt;Again, Check out our &lt;a href="http://www.infoaxe.com/faq.html"&gt;FAQ&lt;/a&gt; for answers to more questions.&lt;br /&gt;&lt;br /&gt;Infoaxe is still very young and we have many more exciting features in the offing that we will be releasing over the next couple of months. So sign up, download the infoaxe toolbar and we hope you find it useful!&lt;br /&gt;&lt;br /&gt;If you like Infoaxe, do tell your friends about it! You can also &lt;a href="http://www.facebook.com/pages/Infoaxe/9829793291?ref=ts"&gt;become a fan of Infoaxe on facebook&lt;/a&gt; (search for infoaxe on facebook and join the growing infoaxe community).&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;The Big Picture: The Infoaxe Vision&lt;/span&gt;&lt;br /&gt;In a user's Web Memory there is enormous knowledge and experience, very similar to what you would find with an avid book reader. Infoaxe connects a user with her Web Memory so that she can better reuse and tap into her Web experience. Think of it as &lt;span style="font-weight:bold;"&gt;your very own Bookcase for the Web&lt;/span&gt; where you have &lt;span style="font-style:italic;"&gt;access to a copy of every page that you ever saw on the Web&lt;/span&gt;. &lt;br /&gt;&lt;br /&gt;We also believe that if we connect you to your Web Memory and allow you to derive value from it, there is also an interesting side effect which is that your friends benefit!&lt;br /&gt;&lt;br /&gt;Here is an example:&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Ann:&lt;/span&gt; Hey do you think I should get the iPhone or the Google Phone?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Mark:&lt;/span&gt; Hmmm...tough call..but hey I remember this great review that I read a while back which compared the two and gave some great insights..&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Ann:&lt;/span&gt; Do you think you can dig up that review again?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Mark:&lt;/span&gt; No prob...I use Infoaxe!! I can refind it &lt;br /&gt;in no time from my Web Memory.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-804547416801943471?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/804547416801943471/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=804547416801943471' title='12 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/804547416801943471'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/804547416801943471'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/11/infoaxe-search-engine-for-your-web.html' title='Infoaxe - A Search Engine for your Web Memory'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zX7ecYghAHA/SRZDk4JyDII/AAAAAAAAEUk/zGNMXr9Jx0Y/s72-c/logo.png' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>12</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-1234893115772183455</id><published>2008-10-23T18:44:00.000-07:00</published><updated>2008-10-23T19:23:28.353-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='conversational agents'/><category scheme='http://www.blogger.com/atom/ns#' term='chatterbot'/><category scheme='http://www.blogger.com/atom/ns#' term='elbot'/><category scheme='http://www.blogger.com/atom/ns#' term='A.I.'/><category scheme='http://www.blogger.com/atom/ns#' term='NLP'/><title type='text'>Meet Elbot</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.elbot.com/images/elbot.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 300px;" src="http://www.elbot.com/images/elbot.jpg" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;I have always been fascinated by conversational agents, ever since I heard about &lt;a href="http://www-ai.ijs.si/eliza/eliza.html"&gt;Eliza&lt;/a&gt; in my undergrad A.I class. &lt;br /&gt;Eliza would use this trick of forming questions out of the statements that a human user might pose to her. For eg. if you told Eliza, "I work at Infoaxe", Eliza might&lt;br /&gt;respond with a, "why do you work at infoaxe?". (the version of Eliza that I have linked to does not give this sort of a response for the above question though. Instead she just says, 'Can you please elaborate on that' :))&lt;br /&gt;Eliza was supposed to parody a therapist. (And yes, she is named after Eliza Doolittle from the G.B. Shaw play, Pygmalion).&lt;br /&gt;&lt;br /&gt;Anyway, I stumbled on to this cute program, called &lt;a href="http://www.elbot.com/"&gt;Elbot&lt;/a&gt; which is a conversational agent too. Playing with Elbot yields hilarious results. :) She (am gonna assume 'she' :)) apparently won the &lt;a href="http://en.wikipedia.org/wiki/Loebner_prize"&gt;Loebner prize&lt;/a&gt; for&lt;br /&gt;chatterbots. Interestingly, enough Elbot fooled 3/12 judges in the first 5 minutes of conversation into thinking she was human! This 25% mark falls just short of what Turing is said to have predicted. Turing had believed that computers would be at ~30% by the end of the century.&lt;br /&gt;&lt;br /&gt;Here is an excerpt from my session with Elbot,&lt;br /&gt;(I did notice she had a tendency to digress and throw in machine/A.I related puns and jokes :))&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Me:&lt;/span&gt; Can I blog about you?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Elbot:&lt;/span&gt;That's a very good question. I'll have to think about it for a microsecond....&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Me:&lt;/span&gt;Do you use Google?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Elbot:&lt;/span&gt; I think I can use that. Once I even used electricity.&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Me:&lt;/span&gt; Do you have a manual that you use?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Elbot:&lt;/span&gt; To use or use not, how do you expect artificial intelligence to figure this out?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Me:&lt;/span&gt; How do you converse like a human, Elbot?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Elbot:&lt;/span&gt; Human beings are amazing creations! More complicated than a Pentium IV processor.&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Me:&lt;/span&gt; What do you think about the fairness of the Turing Test?&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Elbot:&lt;/span&gt; There are some people who would not pass the Turing test themselves. Why must they inflict it on computers?&lt;br /&gt;&lt;br /&gt;hehe....fair enough ;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-1234893115772183455?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/1234893115772183455/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=1234893115772183455' title='15 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/1234893115772183455'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/1234893115772183455'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/10/meet-elbot.html' title='Meet Elbot'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>15</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-5116652297237074569</id><published>2008-07-06T21:26:00.000-07:00</published><updated>2008-12-10T21:53:54.209-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='facebook'/><title type='text'>Facebook Lexicon</title><content type='html'>&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/SHGeJZVy7QI/AAAAAAAAC4w/bMPV178zSWM/s1600-h/facebook.gif"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/SHGeJZVy7QI/AAAAAAAAC4w/bMPV178zSWM/s200/facebook.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5220127327357431042" /&gt;&lt;/a&gt;&lt;br /&gt;I discovered the Facebook Lexicon very recently. It basically counts up words and&lt;br /&gt;phrases over the walls (like orkut's scrap book) of all the facebook users. It then plots the frequency of these words over time on a graph. &lt;br /&gt;I found the following hilarious,  "party tonight, hangover" which compares the&lt;br /&gt;frequencies of both phrases.&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/_zX7ecYghAHA/SHGdo3gib8I/AAAAAAAAC4g/gjpvV32_DZo/s1600-h/party-hang.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/SHGdo3gib8I/AAAAAAAAC4g/gjpvV32_DZo/s400/party-hang.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5220126768519868354" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Notice the near perfect phase shift ;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-5116652297237074569?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/5116652297237074569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=5116652297237074569' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/5116652297237074569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/5116652297237074569'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/07/facebook-lexicon.html' title='Facebook Lexicon'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/SHGeJZVy7QI/AAAAAAAAC4w/bMPV178zSWM/s72-c/facebook.gif' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-879298309248854960</id><published>2008-04-27T00:54:00.001-07:00</published><updated>2008-12-10T21:53:54.863-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Networks'/><category scheme='http://www.blogger.com/atom/ns#' term='startups'/><category scheme='http://www.blogger.com/atom/ns#' term='IM'/><title type='text'>Social Networks, the death knell for Stand-alone IM?</title><content type='html'>&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/SBQ6NWcvd4I/AAAAAAAAC0g/2MrSH_QnYAg/s1600-h/facebook.gif"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/SBQ6NWcvd4I/AAAAAAAAC0g/2MrSH_QnYAg/s200/facebook.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5193840271303866242" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I always suspected that with the advent of Social Networks like Facebook, orkut, Myspace etc. users were using Instant Messaging (IM) less and less. I recall in my high school years instant messaging was the in-thing. Especially for the 13-25 demographic instant messaging was extremely popular. Now it seems that Social&lt;br /&gt;Networks have taken over which seems like an interesting throw-back to the asynchronous nature of communication like email.&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_zX7ecYghAHA/SBQ6Nmcvd5I/AAAAAAAAC0o/Tc1h0y-dwFA/s1600-h/orkut.bmp"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/SBQ6Nmcvd5I/AAAAAAAAC0o/Tc1h0y-dwFA/s200/orkut.bmp" border="0" alt=""id="BLOGGER_PHOTO_ID_5193840275598833554" /&gt;&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;Now, with Facebook's Chat Application and Google integrating Gtalk gently into Orkut, I suspect companies whose core business is IM, like Meebo will feel the pinch. Instant Messaging is still useful but might now be just a feature on a Social Network. Maybe not all demographics but the 13-30 year old web user (which is the demographic on a typical social network) will definitely not be too keen on maintaining a different friends list for Meebo (or other Web based IM clients) and one for Facebook. Sure, Meebo might develop a chat client geared for facebook like social.im but I would think that it will be hard to compete with Facebook on Facebook's turf. And a tug of war with Facebook for user visits seems like a scary prospect.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/_zX7ecYghAHA/SBQ6N2cvd6I/AAAAAAAAC0w/JbZ14FHJTb4/s1600-h/meebo.bmp"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/SBQ6N2cvd6I/AAAAAAAAC0w/JbZ14FHJTb4/s200/meebo.bmp" border="0" alt=""id="BLOGGER_PHOTO_ID_5193840279893800866" /&gt;&lt;/a&gt;&lt;br /&gt;I am sure Meebo, Social.im and others have thought long and hard about this. I am curious to see how these companies evolve and adapt to the changing landscape.I had an opportunity to talk with &lt;a href="http://blog.meebo.com/?page_id=2"&gt;Seth Sternberg&lt;/a&gt; (CEO, Meebo) whom I had invited to talk at the Stanford &lt;a href="http://idk2ceo.stanford.edu/index.htm"&gt;I don't know to CEO&lt;/a&gt; Business Conference. I learnt that Meebo does other things as well, like Meebo rooms which plug in nicely to Myspace, so they definitely have other tricks up their sleeve and I am sure we might see more in the months to come.&lt;br /&gt;&lt;br /&gt;As an aside, on the topic of time spent by users online on Social Networks, I heard &lt;a href="http://en.wikipedia.org/wiki/Max_Levchin"&gt;Max Levchin&lt;/a&gt; (Founder and CEO, Slide &amp; PayPal) remark that most of this time comes from the bucket that users spend on email and other entertainment on TV. I would add stand-alone IM clients to that list.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-879298309248854960?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/879298309248854960/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=879298309248854960' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/879298309248854960'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/879298309248854960'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/04/social-networks-death-knell-for-stand.html' title='Social Networks, the death knell for Stand-alone IM?'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/SBQ6NWcvd4I/AAAAAAAAC0g/2MrSH_QnYAg/s72-c/facebook.gif' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-9037096092844498311</id><published>2008-04-06T17:00:00.000-07:00</published><updated>2008-12-10T21:53:55.086-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Selection Bias'/><category scheme='http://www.blogger.com/atom/ns#' term='Probability Theory'/><title type='text'>Beauty &amp; Intelligence</title><content type='html'>&lt;a href="http://2.bp.blogspot.com/_zX7ecYghAHA/R_luEeLJ18I/AAAAAAAACl8/dYiBiYPfK5E/s1600-h/beautint.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://2.bp.blogspot.com/_zX7ecYghAHA/R_luEeLJ18I/AAAAAAAACl8/dYiBiYPfK5E/s320/beautint.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5186297468992214978" /&gt;&lt;/a&gt;&lt;br /&gt;Here is a fun probability result which has to do with conditional probability. Its not very hard to see what's going on but its a fun result nonetheless. I just made up a dummy example to make it more fun :)&lt;br /&gt;&lt;br /&gt;Lets take 2 fairly independent attributes like beauty and intelligence (It can be argued that they are not independent by appealing to genetics and preferential/unequal selection rights for species perpetuation but lets ignore that for now and think simple). Lets assume beauty and intelligence are independent. &lt;br /&gt;i.e if &lt;br /&gt;B=beautiful&lt;br /&gt;I=intelligence&lt;br /&gt;&lt;br /&gt;P(I)=P(I|B) -&gt;(1)&lt;br /&gt;&lt;br /&gt;Now, think of all the people who stick in your memory (as opposed to you forgetting them after a few days of meeting them). Lets assume (simplistically)&lt;br /&gt;that the people who stick in your memory are ones who are either intelligent or beautiful.&lt;br /&gt;&lt;br /&gt;Lets throw in some numbers.&lt;br /&gt;Lets assume the prior probability of someone being beautiful is&lt;br /&gt;0.4&lt;br /&gt;and for someone being intelligent is 0.1&lt;br /&gt;Lets assume 10% of the beautiful people are intelligent.&lt;br /&gt;&lt;br /&gt;Say, you've met 200 individuals in your lifetime.&lt;br /&gt;&lt;br /&gt;If we go with the assumption of you being able to recollect only people who&lt;br /&gt;are intelligent or beautiful, you will remember&lt;br /&gt;80+12=92 individuals&lt;br /&gt;&lt;br /&gt;Now when you look at this sample set, lo and behold, it looks like intelligence&lt;br /&gt;and beauty are negatively correlated!&lt;br /&gt;&lt;br /&gt;This looks counterintuitive since it looks like,&lt;br /&gt;P(I)=0.21&lt;br /&gt;but P(I|B)=0.1&lt;br /&gt;(thus violating the independence assumption in (1))&lt;br /&gt;&lt;br /&gt;whereas in reality, the conditional probability eqns are,&lt;br /&gt;P(I|B,P) &amp;lt; P(I|P)&lt;br /&gt;where P=I U B&lt;br /&gt;which is another case of selection bias at work..&lt;br /&gt;&lt;br /&gt;Hmmm..I wonder how many people feel this way about beauty and intelligence ;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-9037096092844498311?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/9037096092844498311/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=9037096092844498311' title='12 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/9037096092844498311'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/9037096092844498311'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/04/beauty-intelligence.html' title='Beauty &amp; Intelligence'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zX7ecYghAHA/R_luEeLJ18I/AAAAAAAACl8/dYiBiYPfK5E/s72-c/beautint.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>12</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-4538584619704632341</id><published>2008-02-18T01:14:00.000-08:00</published><updated>2008-12-10T21:53:55.552-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='siddharth jonathan'/><category scheme='http://www.blogger.com/atom/ns#' term='infoaxe'/><category scheme='http://www.blogger.com/atom/ns#' term='search engine'/><title type='text'>Infoaxe - Stealth Search Engine out of Stanford</title><content type='html'>&lt;a href="http://4.bp.blogspot.com/_zX7ecYghAHA/SBq_bGcvd-I/AAAAAAAAC10/TZqy7DCGOFs/s1600-h/infoaxe_logo_black.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/SBq_bGcvd-I/AAAAAAAAC10/TZqy7DCGOFs/s400/infoaxe_logo_black.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5195675592433825762" /&gt;&lt;/a&gt;&lt;br /&gt;I have some BIG news in this post. I have just started my company, &lt;a href="http://infoaxe.com"&gt;Infoaxe&lt;/a&gt; along with my long time friend and classmate from Stanford, &lt;a href="http://infolab.stanford.edu/~kvijay/"&gt;Vijay Krishnan&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Infoaxe is the next generation search engine searching a very different kind of Web.&lt;br /&gt;We  are in stealth mode currently and hence my lips are sealed. But stay tuned for updates. Infoaxe is short for 'Information Access' with some liberties taken :)&lt;br /&gt;It could also be a reference to the &lt;a href="http://en.wikipedia.org/wiki/The_Stanford_Axe"&gt;Stanford Axe&lt;/a&gt; ;).&lt;br /&gt;&lt;br /&gt;We developed the Infoaxe Search Engine while we were graduate students at Stanford.&lt;br /&gt;We are very excited to have &lt;a href="http://infolab.stanford.edu/people/hector.html"&gt;Prof. Hector Garcia Molina&lt;/a&gt; on our technical advisory board.&lt;br /&gt;&lt;br /&gt;We are really excited about Infoaxe which has at its core many innovations in applications of data mining and machine learning.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-4538584619704632341?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/4538584619704632341/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=4538584619704632341' title='17 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/4538584619704632341'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/4538584619704632341'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2008/02/infoaxe-stealth-search-engine-out-of.html' title='Infoaxe - Stealth Search Engine out of Stanford'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_zX7ecYghAHA/SBq_bGcvd-I/AAAAAAAAC10/TZqy7DCGOFs/s72-c/infoaxe_logo_black.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>17</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-6726621868022282369</id><published>2007-12-27T23:12:00.000-08:00</published><updated>2008-12-10T21:53:55.754-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Roulette'/><category scheme='http://www.blogger.com/atom/ns#' term='Probability Theory'/><title type='text'>Roll with the punches..tomorrow is another day..</title><content type='html'>&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/R3Sk2QRHXmI/AAAAAAAACeg/DugDIMaqIVs/s1600-h/roulette-wheel.jpg"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/R3Sk2QRHXmI/AAAAAAAACeg/DugDIMaqIVs/s320/roulette-wheel.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5148921525977964130" /&gt;&lt;/a&gt;&lt;br /&gt;The title of today's post has only a vague connection with the substance of today's post. Today's post is about a betting strategy for roulette. This strategy has an intrinsic flaw and I am going to leave it to my readers to ponder as to what the flaw is. I'll explain the flaw in a later post. &lt;br /&gt;In roulette, the casino houses typically offer odds such that the expected return is negative (which is how they stay in business). This can be shown as a direct consequence of the law of large numbers.&lt;br /&gt;The betting strategy is remarkably simple. If you lose a round, you double your bet. This is also called 'doubling up' or the Martingale strategy. The intuition is that it is very unlikely that you will have a string of losses and you hope to cover your losses when you eventually win. You have to win sometime right? And at that time you would have recovered all your losses.&lt;br /&gt;Do you see the problem with this strategy?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-6726621868022282369?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/6726621868022282369/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=6726621868022282369' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6726621868022282369'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6726621868022282369'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/roll-with-punchestomorrow-is-another.html' title='Roll with the punches..tomorrow is another day..'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/R3Sk2QRHXmI/AAAAAAAACeg/DugDIMaqIVs/s72-c/roulette-wheel.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-6278416577129912129</id><published>2007-12-19T18:32:00.000-08:00</published><updated>2007-12-19T19:07:43.120-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Math'/><category scheme='http://www.blogger.com/atom/ns#' term='Puzzle'/><category scheme='http://www.blogger.com/atom/ns#' term='Probability Theory'/><title type='text'>The Monty Hall Puzzle</title><content type='html'>Here's a fun probability puzzle. This is called the Monty Hall Puzzle since its based off of an American Game Show, &lt;em&gt;Let's make a deal&lt;/em&gt;.&lt;br /&gt;&lt;br /&gt;Say you're in a game show where the host invites you to open one of 3 doors. One of the doors has a prize(a car) and the other 2 doors have goats. The host knows which door has the prize. After you pick a door, the host then opens one of the remaining 2doors which do not have the prize and reveals the goat. Now, he offers you the option of switching your choice from your current pick to the other remaining door.&lt;br /&gt;&lt;br /&gt;The question is, what should you do?&lt;br /&gt;A. Doesn't matter if you switch, the probability of winning is still the same&lt;br /&gt;B. Always switch&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-6278416577129912129?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/6278416577129912129/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=6278416577129912129' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6278416577129912129'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6278416577129912129'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/monty-hall-puzzle.html' title='The Monty Hall Puzzle'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-2398927945006054411</id><published>2007-12-18T22:46:00.000-08:00</published><updated>2008-12-10T21:53:57.156-08:00</updated><title type='text'>The Challenge of Duplicates on the Web</title><content type='html'>&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/R2jHZQRHXdI/AAAAAAAACdQ/ipj1UA3LDDs/s1600-h/google-yahoo-thumb.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/R2jHZQRHXdI/AAAAAAAACdQ/ipj1UA3LDDs/s320/google-yahoo-thumb.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5145581810948201938" /&gt;&lt;/a&gt;&lt;br /&gt;A few weeks back, &lt;a href="http://chadwa.wordpress.com/"&gt;Chad Walters&lt;/a&gt;, the Search Architect at &lt;a href="http://powerset.com"&gt;Powerset&lt;/a&gt; told me an interesting anecdote from the time he headed the Runtime efforts at &lt;a href="http://yahoo.com"&gt;Yahoo!&lt;/a&gt;.&lt;br /&gt;There is a fundamental problem in estimating the number of hits for search queries because of duplicates in Web Search results. Near Duplicates &amp; Duplicates occur because of many reasons like mirroring, RSS feeds, track backs etc. These add a fair amount of noise to the estimates of Web Search Engines about the size of their indexes which is always a matter of some pride to the big guys (&lt;a href="http://google.com"&gt;Google&lt;/a&gt;, &lt;a href="http://search.yahoo.com"&gt;Yahoo!&lt;/a&gt; &amp; &lt;a href="http://live.com"&gt;Microsoft&lt;/a&gt;) and rightly so.&lt;br /&gt;Anyway, a french researcher, Jean Véroni did some analysis on the number of hits reported by search engines and writes an interesting article &lt;a href="http://aixtal.blogspot.com/2006/07/search-crazy-duplicates-1.html"&gt;here&lt;/a&gt;.&lt;br /&gt;Here are some charts from his blog (http://aixtal.blogspot.com/2006/07/search-crazy-duplicates-1.html),&lt;br /&gt;&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/_zX7ecYghAHA/R2rH6QRHXfI/AAAAAAAACdg/9BLpIZ-2F0c/s1600-h/google-dups.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/R2rH6QRHXfI/AAAAAAAACdg/9BLpIZ-2F0c/s320/google-dups.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5146145327837306354" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;p align=center&gt; Google with similar pages &lt;/p&gt;&lt;br /&gt;&lt;br&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_zX7ecYghAHA/R2rHxARHXeI/AAAAAAAACdY/DSYXQ5IJNHs/s1600-h/gg-segolisme2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/R2rHxARHXeI/AAAAAAAACdY/DSYXQ5IJNHs/s320/gg-segolisme2.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5146145168923516386" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;p align=center&gt; Google without similar pages &lt;/p&gt;&lt;br /&gt;&lt;br&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/R2rIPwRHXgI/AAAAAAAACdo/Dl-0qFpLXYs/s1600-h/yahoo-dups1.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/R2rIPwRHXgI/AAAAAAAACdo/Dl-0qFpLXYs/s320/yahoo-dups1.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5146145697204493826" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;p align=center&gt; Yahoo! with similar pages &lt;/p&gt;&lt;br /&gt;&lt;br&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_zX7ecYghAHA/R2rIUARHXhI/AAAAAAAACdw/-8zaXozd1Hs/s1600-h/yy-segolisme2.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://4.bp.blogspot.com/_zX7ecYghAHA/R2rIUARHXhI/AAAAAAAACdw/-8zaXozd1Hs/s320/yy-segolisme2.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5146145770218937874" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;p align=center&gt; Yahoo! without similar pages &lt;/p&gt;&lt;br /&gt;&lt;br&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The fluctuation in the estimates is apparent when similar pages are included.&lt;br /&gt;&lt;br /&gt;To quote Jean Véronis from his blog, &lt;br /&gt;&lt;em&gt;&lt;br /&gt;It’s interesting to note that:&lt;br /&gt;    * once duplicates are removed, Google and Yahoo’s figures are about the same;&lt;br /&gt;    * Yahoo’s curves are much more stable than Google’s.&lt;/em&gt;&lt;br /&gt;&lt;br /&gt;I found this interesting in the context of the battle for supremacy in Web Search.&lt;br /&gt;Maybe Yahoo! does a better job than its given credit (or market share) for :)&lt;br /&gt;&lt;br /&gt;I did some work on near duplicate detection on the Web as part of my graduate research while at Stanford with &lt;a href="http://infolab.stanford.edu/~paepcke/"&gt;Andreas Paepcke&lt;/a&gt; of the &lt;a href="http://infolab.stanford.edu"&gt;Stanford InfoLab&lt;/a&gt; and that's one reason I found this interesting. My work can be found &lt;a href="http://infolab.stanford.edu/~jonsid/spotsigs.pdf"&gt;here&lt;/a&gt;.&lt;br /&gt;(Thanks to Chad for this interesting tidbit)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-2398927945006054411?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/2398927945006054411/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=2398927945006054411' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/2398927945006054411'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/2398927945006054411'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/challenge-of-duplicates-on-web.html' title='The Challenge of Duplicates on the Web'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zX7ecYghAHA/R2jHZQRHXdI/AAAAAAAACdQ/ipj1UA3LDDs/s72-c/google-yahoo-thumb.jpg' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-2916567440401432878</id><published>2007-12-15T01:59:00.000-08:00</published><updated>2008-12-10T21:53:58.099-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Powerset'/><category scheme='http://www.blogger.com/atom/ns#' term='Wikipedia'/><category scheme='http://www.blogger.com/atom/ns#' term='Web Search'/><title type='text'>What has Search done to the Web?</title><content type='html'>&lt;a href="http://1.bp.blogspot.com/_zX7ecYghAHA/R2OsaARHW7I/AAAAAAAACXU/uJOXT_B_9_s/s1600-h/google-y.gif"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/R2OsaARHW7I/AAAAAAAACXU/uJOXT_B_9_s/s320/google-y.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5144144762135600050" /&gt;&lt;/a&gt;&lt;br /&gt;I came across this &lt;a href="http://www.youtube.com/watch?v=pWZElxi0pWU"&gt;interesting video&lt;/a&gt; from 1996 where Marc Andreessen is interviewed about Netscape and the future of the Web in general.&lt;br /&gt;Its fascinating to hear Marc's vision of the Internet and the inevitable question of how Netscape sees Microsoft. Ok, how does this relate to Web Search? Let me tell you.&lt;br /&gt;When Marc is asked about the impact of the browser on the internet, he explains it as follows. (to paraphrase) The browser basically made it easier for more people to view the Web. Only as more people started viewing the Web, did it make sense for more people to create Web Pages. He adds a nice analogy here. Its just like we wouldn't have books if there weren't any readers.&lt;br /&gt;I see Web Search as being fundamentally similar to this. The exposure and access to information that Web Search Engines like &lt;a href="http://google.com"&gt;Google&lt;/a&gt; gave to the Web has definitely fueled the growth of the WWW. Although the impact of Web search on the &lt;em&gt;quantity&lt;/em&gt; of Webpages is fairly clear, I am not too sure what the impact on &lt;em&gt;quality&lt;/em&gt; is though. Then again, quality is too subjective an attribute for the most part.&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/_zX7ecYghAHA/R2OsggRHW8I/AAAAAAAACXc/-Q_voh4aCqA/s1600-h/powerset.jpg"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand;" src="http://3.bp.blogspot.com/_zX7ecYghAHA/R2OsggRHW8I/AAAAAAAACXc/-Q_voh4aCqA/s320/powerset.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5144144873804749762" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://powerset.com"&gt;Powerset&lt;/a&gt; (where I currently work) is building a Natural Language Search Engine. It is starting out with a &lt;a href="https://labs.powerset.com/login"&gt;Search Engine for Wikipedia&lt;/a&gt;, and then will move to the WWW. On the same note, I think Powerset has the potential to improve the quality of &lt;a href="http://en.wikipedia.org"&gt;Wikipedia&lt;/a&gt;, by offering a better search for it (enabling more people to find what they want, edit what they want etc). &lt;br /&gt;Google and other Web Search Engines have contributed a fair bit to Wikipedia's growth by showing Wikipedia results in the top search results for a lot of queries.&lt;br /&gt;&lt;a href="http://battellemedia.com/archives/004115.php"&gt;John Battelle's blog post&lt;/a&gt; cites Google and Yahoo! as showing Wikipedia results in 27% and 31% of search queries respectively.&lt;br /&gt;Its going to be fun to see how Google's &lt;a href="http://www.news.com/8301-10784_3-9834175-7.html?tag=nefd.top"&gt;Knol&lt;/a&gt; plays into all of this..&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-2916567440401432878?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/2916567440401432878/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=2916567440401432878' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/2916567440401432878'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/2916567440401432878'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/what-has-search-done-to-web.html' title='What has Search done to the Web?'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zX7ecYghAHA/R2OsaARHW7I/AAAAAAAACXU/uJOXT_B_9_s/s72-c/google-y.gif' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-1079326047890962118</id><published>2007-12-04T23:41:00.000-08:00</published><updated>2007-12-04T23:46:48.915-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bias'/><category scheme='http://www.blogger.com/atom/ns#' term='Wikipedia'/><title type='text'>Wikipedia Bias</title><content type='html'>I guess a lot has been said about the nerd bias in Wikipedia.&lt;br /&gt;I found this hilarious website &lt;a href="http://www.wikigroaning.com "&gt;http://www.wikigroaning.com&lt;/a&gt; that tries to quantify the bias (I have no idea how they get their numbers although some quick checks indicate that it is not completely random).&lt;br /&gt;My favorite was world vs world of warcraft. :)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-1079326047890962118?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/1079326047890962118/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=1079326047890962118' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/1079326047890962118'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/1079326047890962118'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/wikipedia-bias.html' title='Wikipedia Bias'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-3930757811661060502</id><published>2007-12-04T23:05:00.000-08:00</published><updated>2008-12-10T21:53:58.652-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Stanford'/><category scheme='http://www.blogger.com/atom/ns#' term='IEEE'/><category scheme='http://www.blogger.com/atom/ns#' term='Powerset'/><category scheme='http://www.blogger.com/atom/ns#' term='Google'/><category scheme='http://www.blogger.com/atom/ns#' term='Marissa Mayer'/><title type='text'>Google at Stanford</title><content type='html'>&lt;a href="http://1.bp.blogspot.com/_zX7ecYghAHA/R1ZQmfr3-6I/AAAAAAAACT0/KM22_YRjsSM/s1600-h/marissa.JPG"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_zX7ecYghAHA/R1ZQmfr3-6I/AAAAAAAACT0/KM22_YRjsSM/s320/marissa.JPG" border="0" alt=""id="BLOGGER_PHOTO_ID_5140384646960774050" /&gt;&lt;/a&gt;&lt;br /&gt;This blog is primarily meant to be at the crossroads of A.I and Web Search. I guess the previous posts were more or less on the theory of probabilistic models. This time we digress a bit for some Stanford news.&lt;br /&gt;&lt;br /&gt;Last week, &lt;a href="http://www.google.com/corporate/execs.html#marissa"&gt;Marissa Mayer&lt;/a&gt; was over at Stanford to give a talk at the Stanford IEEE Chapter to a packed audience. Marissa is the VP of Search Products and User Experience at Google. I have always been a fan of Marissa's and it was great to meet her in  person. &lt;br /&gt;&lt;br /&gt;Her talk revolved around the early Google days and the general philosophy that Google typically follows with its product launches and strategy. She explained these with interesting real life anecdotes. She fielded plenty of questions from all sides of the park and also stuck around for more questions after the talk.&lt;br /&gt;&lt;br /&gt;Some of the points I remember are,&lt;br /&gt;   * launch early&lt;br /&gt;   * listen to the data&lt;br /&gt;   * its ok to be unconventional&lt;br /&gt;   * solve big problems&lt;br /&gt;   &lt;br /&gt;When someone in the audience asked her about her thoughts on the Semantic Web, she made the distinction between the 'semantic web' and 'understanding semantics'. The former being hand built ontologies to enforce some form of structure and the latter being understanding meaning and intent. She seemed to think the latter had promise.&lt;br /&gt;This is exactly my stand on the topic. (Incidentally, I work at &lt;a href="http://powerset.com"&gt;Powerset&lt;/a&gt; which does the latter as well)&lt;br /&gt;&lt;br /&gt;She also touched upon Google's ambitions with respect to Books, Machine Translation, Earth etc.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-3930757811661060502?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/3930757811661060502/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=3930757811661060502' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/3930757811661060502'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/3930757811661060502'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/12/google-at-stanford.html' title='Google at Stanford'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zX7ecYghAHA/R1ZQmfr3-6I/AAAAAAAACT0/KM22_YRjsSM/s72-c/marissa.JPG' height='72' width='72'/><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-5728119850226230936</id><published>2007-11-05T22:35:00.000-08:00</published><updated>2007-11-05T23:34:59.309-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Simpson&apos;s Paradox'/><category scheme='http://www.blogger.com/atom/ns#' term='Probabilistic Graphical Models'/><category scheme='http://www.blogger.com/atom/ns#' term='Probability Theory'/><title type='text'>Simpson's Paradox explained...</title><content type='html'>The apparent paradox arises because, taking the drug is correlated with Gender.&lt;br /&gt;In the above example, say men are more likely to take the drug than women. Say 75% of men take the drug while only 25% of women take the drug.&lt;br /&gt;Let me explain it with some numbers, say women and men are equally represented in the population. (say 100 men and 100 women) Then there will be 75 men who will take the drug and 25 women who will take the drug. From the statistics mentioned earlier among those who took the drug, 52.5 men (70%) will be cured and 5 women(20%) will be cured. So 57.5% of those who took the drug are cured. By similar reasoning, 20 men who did not take the drug are cured and 30 women who did not take the drug are cured. i.e. 50% of those who did not take the drug are cured which explain the surprising result.&lt;br /&gt;&lt;br /&gt;The moral of the story is that the right probabilistic query to ask the model is not the observational query P(cure|drug) but the causal query P(cure|do(drug))&lt;br /&gt;&lt;br /&gt;It is interesting how Simpson's paradox has at times been used to explain altruism in a Darwinian setting wherein natural selection inherently disadvantages individuals who confer benefits on their competitors. The &lt;a href="http://plato.stanford.edu/entries/paradox-simpson/#Evolutionary"&gt;Stanford Encyclopedia&lt;/a&gt; explains this fairly well. The summary is that although seemingly counter-intuitive, populations generally sustain altruistic individuals and do not get run over with selfish individuals thus relieving the lazy/inefficient/incapable individuals of some evolutionary stress to survive ;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-5728119850226230936?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/5728119850226230936/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=5728119850226230936' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/5728119850226230936'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/5728119850226230936'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/11/simpsons-paradox-explained.html' title='Simpson&apos;s Paradox explained...'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-6671746495454132138</id><published>2007-10-28T00:43:00.000-07:00</published><updated>2007-10-28T01:22:50.777-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Probabilistic Models'/><category scheme='http://www.blogger.com/atom/ns#' term='Causal Models'/><title type='text'>Simpson's Paradox</title><content type='html'>I found Simpson's paradox fascinating the first time I came across it in a course by&lt;br /&gt;&lt;a href="http://robotics.stanford.edu/%7Ekoller/"&gt;Daphne Koller&lt;/a&gt;,&lt;br /&gt;The paradox (loosely) states the following,&lt;br /&gt;Say we are trying to decide whether a drug is beneficial in curing a disease in a population where males and females are equally represented. We are given statistics that 57.5% of patients who took the drug are cured whereas only 50% of the patients who did not take the drug are cured. This seems to suggest that the drug is beneficial in general. But, we are also told that 70% of the male patients who took the drug are cured whereas 80% of males who did not take the drug are cured. Among females, 20% who took the drug are cured and 40% who did not take the drug are cured. These numbers look surprising! Right? Thus despite the apparently harmful effect of the drug on both men and women, the overall effect of the drug seems beneficial.&lt;br /&gt;&lt;br /&gt;How is this possible?&lt;br /&gt;&lt;br /&gt;I'll explain the result in my next post! Stay tuned...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-6671746495454132138?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/6671746495454132138/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=6671746495454132138' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6671746495454132138'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/6671746495454132138'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/10/simpsons-paradox.html' title='Simpson&apos;s Paradox'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7987606611891782827.post-8991594104032516911</id><published>2007-04-04T00:57:00.000-07:00</published><updated>2007-04-12T15:07:32.932-07:00</updated><title type='text'>introduction</title><content type='html'>Hi,&lt;br /&gt;I'm Jonathan and in this blog, I hope to publish my thoughts as to where I think&lt;br /&gt;web search and information retrieval advances are headed and also to write about things that I think are cool and pushing the envelope in terms of innovative research in computer science&lt;br /&gt;although with a primary bias on the above topics.&lt;br /&gt;&lt;br /&gt;Jonathan&lt;br /&gt;&lt;a href="http://stanford.edu/~jonsid"&gt;http://stanford.edu/~jonsid&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7987606611891782827-8991594104032516911?l=searching-for-ai.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://searching-for-ai.blogspot.com/feeds/8991594104032516911/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=7987606611891782827&amp;postID=8991594104032516911' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/8991594104032516911'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7987606611891782827/posts/default/8991594104032516911'/><link rel='alternate' type='text/html' href='http://searching-for-ai.blogspot.com/2007/04/introduction.html' title='introduction'/><author><name>Jonathan Siddharth</name><uri>http://www.blogger.com/profile/07788292704272921873</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='09248598884304014382'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry></feed>