tag:blogger.com,1999:blog-262559832009-07-20T09:27:26.795+02:00Mining Drug Space"Communication is the essence of science"” (Francis Crick)Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comBlogger149125tag:blogger.com,1999:blog-26255983.post-72822904651567278552009-06-07T19:02:00.017+02:002009-06-07T22:22:35.321+02:00Google Wave - Mission Drug Design Federation<blockquote>"<span class="Apple-style-span" style="font-style: italic;">The Google Wave Federation Protocol may succeed email (an innovation from 1965), as the dominant form of Internet communication.</span>"<br />[<a href="http://en.wikipedia.org/wiki/Google_Wave_Federation_Protocol">Google Wave Federation Protocol @WP</a>]</blockquote>Google launched <a href="http://en.wikipedia.org/wiki/Google_Wave">Google Wave</a>. It is a mind-blowing <a href="http://mashable.com/2009/05/28/google-wave-guide/">technical platform</a>, which might just "change" <a href="http://davidhburton.com/author/blog//?p=1467">publishing</a>, <a href="http://scienceroll.com/2009/06/05/google-wave-in-science-and-medicine/">health support</a> of patients and physicians, and any other industry, where knowledge workers are suffering from <a href="http://kftf.ischool.washington.edu/index.htm">keeping found things found</a> (KFTF) and collaboration security (see <a href="http://friendfeed.com/sciphu/4bf7c857/on-challenges-of-conference-blogging-genetic">conference blogging</a>, <a href="http://friendfeed.com/science-2-0/00b2beed/message-to-nature-network-users-0-replies-via">libel law</a>, <a href="http://miningdrugs.blogspot.com/2009/05/we-know-that-scientific-collaboration.html">compliant commenting</a>, <a href="http://scienceroll.com/2008/03/15/dangers-of-web-20-in-medicine/">medicine 2.0 danger</a>). I do not think the GWave will replace collaboration tools, but it might enrich them, by facilitating information moderation and bridging (for all peers and channels).<div><br /></div><div>You can check first the Google Wave presentation (almost 1.5 hours), or continue reading below.<br /><div><br /><object height="344" width="425"><param name="movie" value="http://www.youtube.com/v/v_UyVmITiYQ&amp;hl=de&amp;fs=1&amp;"><param name="allowFullScreen" value="true"><param name="allowscriptaccess" value="always"><embed src="http://www.youtube.com/v/v_UyVmITiYQ&amp;hl=de&amp;fs=1&amp;" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" height="344" width="425"></embed></object><br /><br /></div><div><span class="Apple-style-span" style="font-weight: bold;"><span class="Apple-style-span" style="font-size:x-large;">What is the negative side?</span></span></div><div>There are two critical posts about GWave (<a href="http://ff.im/3ACGz">1</a>,<a href="http://friendfeed.com/google-wave/68417159/google-wave-doesn-t-look-like-tsunami-jungleg">2</a>), and I comment on them here.</div><div><ul><li><span class="Apple-style-span" style="font-weight: bold;">like instant messaging, an attention keeper</span>. I agree, but you could say the same for phone calls, meetings, or any other some2some, or some2many communication form. So, this can be easily disarmed by providing an "I am in the work-flow or busy" button, so do not expect immediate response. Just leave me alone and do not cause a workplace <a href="http://en.wikipedia.org/wiki/Distraction">distraction</a>.</li><li><span class="Apple-style-span" style="font-weight: bold;">instant typing and sentence rewriting is creepy</span>. I agree, it is like sitting in front of a washing machine, not really educational. There should be some degree of federation cache a user can trigger. Sometimes I might want to write even a full paragraph before getting viewed by others.</li><li><span class="Apple-style-span" style="font-weight: bold;">CCing wrong person.</span> This is a security leakage problem, and can happen with any electronic information. It is just a gut feeling, but I would assume that GWave could even trigger a recall or hold-back at any later stage. So, it might be even safer then the actual eMail system? Anyway, I still think that all confidential data should come with a <a href="http://oneswarm.cs.washington.edu/">peer2peer privacy</a> option, or any other form of encryption/decryption.</li><li><span class="Apple-style-span" style="font-weight: bold;">another silo of communication.</span> A what? If anything can bridge silos, then communication. The demo has shown that GWave can integrate already many other data silos, and it provides even an open communication layer. Thumbs up for Google for making the standard open.</li><li><span class="Apple-style-span" style="font-weight: bold;">user-wall gardening.</span> This is serious and opens again the question how many IDs a <a href="http://nsaunders.wordpress.com/2008/01/17/researcher-id/">researcher</a> can and should have nowadays. Why do you think are people talking about global IDs, like <a href="http://en.wikipedia.org/wiki/OpenID">OpenID</a>? Exactly, to get rid of all the many single logins. So, I rather trust Google then any other small startup for hosting my identity information. Though, any company with a lot of data has to take user data, especially patient data, serious, right <a href="http://friendfeed.com/silpol/820ca0f8/deny-this-last-fm">last.fm</a>? </li><li><span class="Apple-style-span" style="font-weight: bold;">eMail threads work, so why the Wave hype.</span> Honestly, I had already many2many threads with overlapping replies, and believe me, eMail fragmentation is a problem. At some point I was ending up with copying eMail parts to a word processor for allowing me to follow the argumentation sub-trees. And no, a live meeting is not an option, if people are in different (overlapping) meetings, buildings, and countries.</li></ul></div><div><span class="Apple-style-span" style="font-weight: bold;"><span class="Apple-style-span" style="font-size:x-large;">What is in there for drug design?</span></span></div><div><span class="Apple-style-span" style="font-weight: bold;"><span class="Apple-style-span" style="font-weight: normal;">The key feature is the federation of data and people sources, something like <a href="http://en.wikipedia.org/wiki/FriendFeed">FriendFeed</a> behind your company firewall in a secure legal environment. </span></span></div><div><span class="Apple-style-span" style="font-weight: bold;"><span class="Apple-style-span" style="font-weight: normal;"><blockquote>"<span class="Apple-style-span" style="font-style: italic;">@Google, please take data security very seriously, only a few people might want that confidential data is moving to your server. We can talk about <span class="Apple-style-span" style="font-weight: bold;">contribution tokens and embedding tools</span>, but data should stay in the remote company silo.</span>" [personal note]</blockquote></span></span></div><div><ul><li><span class="Apple-style-span" style="font-weight: bold;">Business Intelligence and News Alerts.</span> You might follow a series of business intelligence news, journal article news, or other news feeds. Is it not sometimes a pity, that the commenting happens all over the place? Sure, there are <a href="http://www.backtype.com/">BackType</a> and <a href="http://disqus.com/">Disqus</a> for a comment federation, but it would be so much nicer, if everything could be federated in one stream. The Google Wave demo showed a federation example with a blog post, so I assume this will be possible.</li><li><span class="Apple-style-span" style="font-weight: bold;">Dynamic Reporting.</span> Imagine you get an updated data report for your project X. As usual, people can reply via eMail and tell you what is nice or unexpected. But, it would be so much nicer, if people could actually collaborate on the data in one workflow, instead of switching forward and backward between different data sources and communication media. This could really boost efficiency.</li><li><span class="Apple-style-span" style="font-weight: bold;">Data Silo Bridging.</span> Like many other tools is the trend going towards peer and feed federation, like on FriendFeed, Yahoo pipes, or RSS readers. This are all tools with their own API. In other words, you will face the same communication problems you have with data warehouses, either you create all NxN relationships between the services, or communication will be inefficient. Why do you think have people invented <a href="http://www.biodas.org/wiki/Main_Page">BioDAS</a> or <a href="http://www.biomart.org/">BioMART</a>? Exactly, for having only one service many others can talk to, just like Google Wave. This will reduce the number of communication and API bridging efforts dramatically.</li><li><span class="Apple-style-span" style="font-weight: bold;">Project Feedback.</span> If I have a question within a project where the project members are not sitting in my room, my building, my campus, or even my country. What can I do? I write an eMail, and expect sometimes tree-like eMails threads, which are overlapping. If you like playing eMail puzzle, this might be fun, it is certainly not when you are having a busy work day. Then eMail fragmentation is a distraction. GWave looks promising and might just avoid this problem, totally.</li></ul></div><div>Finally, I think Google Wave can certainly reduce the information fragmentation and data silo bridging problem with its <a href="http://en.wikipedia.org/wiki/Google_Wave_Federation_Protocol">new and open federation protocol</a>.<br />I hope it might find its way into businesses with strict communication regulations, eg. Pharma, Medicine, or any other organization dealing with patient data. I further hope that Google takes <a href="http://miningdrugs.blogspot.com/2009/05/we-know-that-scientific-collaboration.html">data security</a> and <a href="http://en.wikipedia.org/wiki/Data_deduplication">data duplication</a> issues very seriously in their federation protocol. Google might have here an ability to tackle both things, I am looking forward to it.</div></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-7282290465156727855?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com4tag:blogger.com,1999:blog-26255983.post-28425566217719007282009-05-30T00:09:00.010+02:002009-05-30T00:47:19.142+02:00Drug popularity (via Google queries) - Yet Another Long Tail (YALT)Here some drug popularity trends based on automatic Google queries. The drug names were taken from <a href="http://www.drugbank.ca/">DrugBank</a>.<script type="text/javascript" src="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/23abd28a4c9d11debe2e000255111976/comments/23b04ed24c9d11debe2e000255111976.js?width=400&amp;height=350"></script><script type="text/javascript" src="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/e4adb3664c9a11dea898000255111976/comments/e4b947444c9a11dea898000255111976.js?width=400&amp;height=350"></script><br /><div><span style="font-weight:bold;">The top ten are</span><br /><ol><li>Cholesterol<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=5775&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Aspirin<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=2157&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Ethanol<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=682&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Ibuprofen<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=3544&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li><a href="http://en.wikipedia.org/wiki/Insulin">Insulin</a><br></br><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_cUdwVNvWCbc/SiBjZEly7NI/AAAAAAAAAgA/i6CSLFVvRP0/s1600-h/250px-InsulinHexamer.jpg"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 250px; height: 170px;" src="http://1.bp.blogspot.com/_cUdwVNvWCbc/SiBjZEly7NI/AAAAAAAAAgA/i6CSLFVvRP0/s320/250px-InsulinHexamer.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5341378440441556178" /></a><br /></li><li>Acetaminophen<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=1906&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Testosterone<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=5791&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Cocaine<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=10194104&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Oxycodone<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=4447649&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script><br /></li><li>Vardenafil<br></br><script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=99300&amp;w=150&amp;eid=Drug+popularity+2009-05+(top+10)"></script></li></ol><br /><span class="Apple-style-span" style="font-weight: bold;">References</span><ul><li>I used a Python script for the Google data fetching. Let me know, if you need more details.</li><li>The Long Tail <a href="http://en.wikipedia.org/wiki/The_Long_Tail">@Wikipedia</a></li><li><span class="Apple-style-span" style="font-style: italic;">DrugBank: a knowledgebase for drugs, drug actions and drug targets. </span><br />Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M.<br />Nucleic Acids Res. <span class="Apple-style-span" style="font-weight: bold;">2008, </span>36, (Database issue):D901-6.<br />PMID: <a href="http://www.ncbi.nlm.nih.gov/pubmed/18048412">18048412</a><br /></li><li><span class="Apple-style-span" style="font-style: italic;">DrugBank: a comprehensive resource for in silico drug discovery and exploration.</span><br />Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J.<br />Nucleic Acids Res. <span class="Apple-style-span" style="font-weight: bold;">2006, </span>34, (Database issue):D668-72. <br />PMID: <a href="http://www.ncbi.nlm.nih.gov/pubmed/16381955">16381955</a></li></ul></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-2842556621771900728?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com2tag:blogger.com,1999:blog-26255983.post-51057322818163762872009-05-26T22:18:00.008+02:002009-05-26T22:52:42.260+02:00Compliant commenting in science 2.0We know that <a href="http://www.nature.com/news/2009/090318/full/458274a.html">scientific collaboration</a> is important, but what is holding people back to contribute to web 2.0 communities (of people, like you!) ?<br />This <a href="http://friendfeed.com/science-2-0/9af350c1/why-people-do-not-comment-online-articles-what">question was raised on FriendFeed</a> (another web 2.0 platform, a good one). <br /><br />Though I answered parts of this <a href="http://miningdrugs.blogspot.com/2006/09/even-if-online-what-is-missing-for.html">question earlier</a>, here a focus on the increasing number of compliance regulations, especially for people in industry (e.g. drug design). Beside, please check also the <a href="http://scienceroll.com/2008/03/15/dangers-of-web-20-in-medicine/">dangers in health/medicine 2.0</a>.<br /><div style="width:425px;text-align:left" id="__ss_1491990"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/joergkurtwegner/science-and-compliant-commenting?type=powerpoint" title="Science And Compliant Commenting">Science And Compliant Commenting</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=scienceandcompliantcommenting-090526150607-phpapp02&amp;stripped_title=science-and-compliant-commenting"><param name="allowFullScreen" value="true"><param name="allowScriptAccess" value="always"><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=scienceandcompliantcommenting-090526150607-phpapp02&amp;stripped_title=science-and-compliant-commenting" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">OpenOffice presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/joergkurtwegner">Joerg kurt Wegner</a>.</div></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-5105732281816376287?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-50959450216304440622009-05-19T21:16:00.012+02:002009-05-19T22:53:57.982+02:00Peer recognition and building relationships - Even in drug design<blockquote>"<span style="font-style: italic;">Experts will want to contribute to coworkers who need them, who will hear them, who will respect them and who may even thank them. ... Rather than asking, How do we incentivise people to share their knowledge? It would be more useful to ask, How do we develop relationships across the globe that will set in motion more knowledge sharing?</span>" [slightly generalized after <a href="http://www.nancydixonblog.com/2009/03/the-incentive-question-or-why-people-share-knowledge.html">Nancy Dixon</a>]</blockquote><blockquote>"<span style="font-style: italic;">Today there seems to be a new distribution model that is emerging. One that is based on people’s ability to publically syndicate and distribute messages — aka content — in an open manner. This has been a part of the internet since day one — yet now its emerging in a different form — it's not pages, it's streams, its social and so its syndication.</span>" [<a href="http://www.businessinsider.com/henry-blodget-the-rise-of-social-distribution-networks-2009-5">John Borthwick</a>]</blockquote>Recently, there are many efforts ongoing to build relationships in the drug design community, and yes, the peers are recognizing and appreciating those efforts.<br /><ul><li>WikiChemists and Chemical Abstract Service (CAS) announced their <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2009-05-18/Chemistry_data">collaboration</a>. Beside, we should never forget that this is a community achievement of many people, especially the <a href="http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Chemistry">WikiChemists</a> and everybody involved in <a href="http://www.chemspider.com/blog/the-royal-society-of-chemistry-acquires-chemspider.html">ChemSpider (appreciated by RSC</a>). Dear CAS, we appreciate, that <a href="http://pubs.acs.org/cen/email/html/cen_87_i20_8720notw3.html">you appreciate</a>.</li><li><a href="http://www.chemspider.com/blog/welcome-chemmantis-to-chemzoo-and-a-call-for-contributions-to-the-community.html">ChemMantis</a> (<a href="http://www.slideshare.net/AntonyWilliams/crowdsourcing-collaborations-and-textmining-in-a-world-of-open-chemistry-1366224">slides</a>) and <a href="http://www.elseviergrandchallenge.com/team1.html">Reflect</a> (<a href="http://www.elseviergrandchallenge.com/webcast_reflect.html">webcast</a>) bridge chemical and biological data silos. This saves time and enriches information and is peer appreciated.<br /></li><li><a href="http://biskit.pasteur.fr/">Biskit</a> bridges tools and services manipulating and analyzing macromolecular structures, protein complexes, and molecular dynamics trajectories. If you ask me, any efforts making daily life easier and giving access to many tools via one interface is peer appreciated.</li><li><a href="http://www.citeulike.org/user/joergkurtwegner/article/4488840">BioMart</a> and <a href="http://www.citeulike.org/user/joergkurtwegner/article/3064678">DAS</a> (Distributed Annotation System) are both systems designed for accessing many biological data sources via a unified interface. This is peer appreciated.<br /></li><li>The <a href="http://www.citeulike.org/user/joergkurtwegner/article/1756058">Functional Genomics Experiment</a> data model (FuGE) has been developed to facilitate convergence of data standards for high-throughput, comprehensive analyses in biology. FuGE models the components of an experimental activity that are common across different technologies, including protocols, samples and data. The community is<a href="http://www.citeulike.org/user/joergkurtwegner/article/4540763"> using it and contributes</a> to the development of convergent data standards in omics research. Clearly, peer appreciated.<br /></li><li>When thinking about drug or side-effect re-use, a lot of information and people were involved in creating this information. So, helping patients by bringing drugs to people to drugs to people to ... is peer appreciated.<br /><div style="width: 425px; text-align: left;" id="__ss_1457222"><a style="margin: 12px 0pt 3px; font-family: Helvetica,Arial,Sans-serif; font-style: normal; font-variant: normal; font-weight: normal; font-size: 14px; line-height: normal; font-size-adjust: none; font-stretch: normal; display: block; text-decoration: underline;" href="http://www.slideshare.net/larsjuhljensen/predicting-novel-targets-for-existing-drugs-using-side-effect-information?type=powerpoint" title="Predicting novel targets for existing drugs using side effect information">Predicting novel targets for existing drugs using side effect information</a></div></li></ul><object style="margin: 0px;" height="355" width="425"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=jen09talk8-090519041113-phpapp01&amp;stripped_title=predicting-novel-targets-for-existing-drugs-using-side-effect-information"><param name="allowFullScreen" value="true"><param name="allowScriptAccess" value="always"><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=jen09talk8-090519041113-phpapp01&amp;stripped_title=predicting-novel-targets-for-existing-drugs-using-side-effect-information" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" height="355" width="425"></embed></object><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-5095945021630444062?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-38159361860822716112009-05-11T20:44:00.001+02:002009-05-11T20:53:53.663+02:00Tag cloud - Mining Drug Space - May 2009<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_cUdwVNvWCbc/SghzzFc1udI/AAAAAAAAASM/l5vkJpGNEfQ/s1600-h/wordle_mds_2009_05_11.gif"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 166px;" src="http://4.bp.blogspot.com/_cUdwVNvWCbc/SghzzFc1udI/AAAAAAAAASM/l5vkJpGNEfQ/s400/wordle_mds_2009_05_11.gif" border="0" alt="" id="BLOGGER_PHOTO_ID_5334641080094013906" /></a>As created by <a href="http://www.wordle.net/gallery/wrdl/836260/Mining_Drug_Space_2009-05">Wordle</a>.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-3815936186082271611?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-48997540155538437512009-05-06T20:32:00.015+02:002009-05-09T17:38:15.980+02:00Drug design and thinking the unthinkable<blockquote>"<span class="Apple-style-span" style="font-style: italic;">The problem pharma industry faces isn’t that they didn’t see generics coming. They not only saw it miles off, they figured out early on that they needed a plan to deal with it. ... In drug design the unthinkable scenario unfolded something like this: The ability to share data and intellectual properties wouldn’t shrink, it would grow. Walled data silos would prove unpopular and inefficient. ... No one experiment is going to replace what we are now losing with designed drugs, but over time, the collection of new experiments that do work might give us the cure we need.</span>" [adapted from <a href="http://www.shirky.com/weblog/2009/03/newspapers-and-thinking-the-unthinkable/">Clay Shirky</a>]</blockquote><div><div>We need legal support, intellectual property protection, and social legal systems, we need them more than ever ! Sure, we want to work 'openly' together, but not without payment. I think there are little intentions of patients and scientists to starve themselves to death for providing a health benefit to others. The key question is therefore how we can measure knowledge and contributions allowing us to reward the support of patients and scientists? Is there any metric or legal system we can adapt? If you have any idea, let me know !</div><div><blockquote>"<span class="Apple-style-span" style="font-style: italic;">You can’t control what you can't measure.</span>" [<a href="http://en.wikipedia.org/wiki/Software_metric">Tom DeMarco</a>]</blockquote></div><div>What can we learn from</div><div><ul><li><a href="http://en.wikipedia.org/wiki/Lawrence_Lessig">Lawrence Lessing</a> (legal aspects of free cultures), </li><li><a href="http://andrewmcafee.org/blog/?page_id=546">Andrew McAfee</a> (Enterprise 2.0), and the</li><li><a href="http://dotank.nyls.edu/Aboutus.html">Do Tank</a> for delivering <a href="http://miningdrugs.blogspot.com/2009/03/do-drug-innovation-while-information.html">Do Drug Development (D<sup>3</sup>) solutions</a> </li></ul></div></div><br /><div style="text-align: center;"><span class="Apple-style-span" style="color: rgb(0, 0, 238); text-decoration: underline;"><br /></span></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-4899754015553843751?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com1tag:blogger.com,1999:blog-26255983.post-23336449438852035302009-03-30T19:30:00.017+02:002009-03-30T21:20:42.376+02:00Open innovation in drug design? - I do not think so!Triggered by <a href="http://friendfeed.com/e/b4adf06c-39b4-a8b3-0c06-4247ab2c2e06/Walking-the-walk-The-practical-experience-of-Web2/">slides and a discussion</a> from Cameron Neylon was I wondering, to which degree people believe in open innovation for the drug design community?<div>Though the principle seems valid for a few software companies, is there little support for life sciences on this topic.</div><blockquote>"<span class="Apple-style-span" style="font-style: italic;">It's an area that's just not been explored much. The usual flow is very much linear. Can be developed in academia, is then licensed via tech transfer.</span>" [<a href="http://friendfeed.com/e/b4adf06c-39b4-a8b3-0c06-4247ab2c2e06/Walking-the-walk-The-practical-experience-of-Web2/">Deepak</a>]</blockquote><blockquote>"<span class="Apple-style-span" style="font-style: italic;">The question is whether a business can make money by a route that doesn't involve patents (or copyright). There are examples of that, O'Reilly publishing being the most obvious but they are pretty thin.</span>" [<a href="http://friendfeed.com/e/b4adf06c-39b4-a8b3-0c06-4247ab2c2e06/Walking-the-walk-The-practical-experience-of-Web2/">Cameron</a>]</blockquote>In other words, if you do not need the money, do whatever you like openly. <div>If you need the money, e.g. for clinical trials, then please make sure staying within the same legal framework, which is most probably not open! Do not misunderstand this, I still strongly believe in (the legally encapsulated) web 2.0 !<br /></div><div><br /></div><div><span class="Apple-style-span" style="font-weight: bold;">Drug design is not like the IT industry</span></div><div><span class="Apple-style-span" style="font-weight: bold;"><br /></span></div><div>Some people think drug design is like chip or software design, e.g. by (over)simplifying the business models. I heard that before, and I still <a href="http://miningdrugs.blogspot.com/2007/11/andy-grove-when-silicon-indoctrinates.html">doubt so</a>! Chip and software design are mostly an engineering task of known facts or an optimization of measurable parameters. In contrast, drug design uses often facts or parameters from the <a href="http://dx.doi.org/10.1093/bfgp/eln033">tip of the iceberg</a>, but there are <a href="http://miningdrugs.blogspot.com/2006/10/bounded-rational-drug-design.html">much more unknowns</a> , or even things we can not look at (like <a href="http://en.wikipedia.org/wiki/Metabolic_pathway">*all* metabolic pathways</a> in humans)! Beside, the <a href="http://www.1000genomes.org/">exploding</a> amount of information is <a href="http://miningdrugs.blogspot.com/2009/03/do-drug-innovation-while-information.html">making things not easier</a>.<br /><br />On the other hand, who could blame humans for oversimplifying problems ...<br /></div><div></div><blockquote><div><span class="Apple-style-span" style="font-style: italic;">Unless we concentrate very hard, we often simplify a problem, because our minds routinely do so without knowing it.<br /></span></div><div><span class="Apple-style-span" style="font-style: italic;">Many people confuse the statement that "almost all drugs are small molecules" with "almost all small molecules are drugs". Assume that the first statement is true, that 96 percent of drugs are small molecules (not counting biotech, or nutraceutical drugs). This would mean that only about 0.01 percent of small molecules are drugs, since there are more than</span><span class="Apple-style-span" style="font-style: italic;"> 24 million </span><span class="Apple-style-span" style="font-style: italic;">physically existing </span><span class="Apple-style-span" style="font-style: italic;">small molecules and only 4600 drugs, roughly one in five thousand. So the logical mistake makes you (unconsciously) overestimate the odds of a randomly drawn small molecule being a drug by more than five thousand times !</span> [freely adapted after <a href="http://en.wikipedia.org/wiki/The_Black_Swan_%28Taleb_book%29">The Black Swan, p 52</a>]</div></blockquote><span class="Apple-style-span" style="font-weight: bold;"><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/en/1/17/The_black_swan_taleb_cover.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 200px; height: 200px;" src="http://upload.wikimedia.org/wikipedia/en/1/17/The_black_swan_taleb_cover.jpg" alt="" border="0" /></a>References</span><br /><ul><li>4637 small molecule, 137 biotech, and nutraceutical drugs as taken from <a href="http://www.drugbank.ca/stats">DrugBank 2009-03</a>. The number of 24 millions physically existing (as published, not as existing or purchaseable) small molecules was taken from a recent <a href="http://dx.doi.org/10.1021/jo8001276">CAS publication</a>.<br /></li></ul><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-2333644943885203530?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com5tag:blogger.com,1999:blog-26255983.post-17426974581658566452009-03-26T21:23:00.011+01:002009-03-26T22:01:40.133+01:00The future of science and scientists - walking into open innovation ?<blockquote>"<span class="Apple-style-span" style="font-style: italic;">If you just build it they (probably) won’t come...<br />If you don’t build it they definitely won’t come...<br />The community is more important than the service...<br />Researchers are already the tail !</span>" [<a href="http://www.slideshare.net/CameronNeylon/walking-the-walk-the-practical-experience-of-web2-in-research-1201087">Cameron Neylon</a>]<br /></blockquote><blockquote>"<span class="Apple-style-span" style="font-style: italic;">An ideal collaboration market will enable just such an exchange of questions and ideas. It will bake in metrics of contribution so participants can demonstrate the impact their work is having. Contributions will be archived, timestamped, and signed, so it’s clear who said what, and when. Combined with high quality filtering and search tools, the result will be an open culture of trust which gives scientists a real incentive to outsource problems, and contribute in areas where they have a great comparative advantage. This will change science.</span>" [<a href="http://michaelnielsen.org/blog/?p=448">Michael Nielsen</a>]<br /></blockquote><blockquote>"<span class="Apple-style-span" style="font-style: italic;">Specialization will be one of many factors influencing creation of new types of scientists. And what are these types? Let me describe a few.</span>" [<a href="http://freelancingscience.com/2009/03/26/the-future-of-life-scientists/">Pawel Szczesny</a>]<br /></blockquote><div style="width:425px;text-align:left" id="__ss_1201087"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/CameronNeylon/walking-the-walk-the-practical-experience-of-web2-in-research-1201087?type=presentation" title="Walking the walk - the practical experience of Web2 in research">Walking the walk - the practical experience of Web2 in research</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=userscamerondocumentsaa-isisdocspresentationsscience2-0esiweb2workshop-090326050843-phpapp01&amp;stripped_title=walking-the-walk-the-practical-experience-of-web2-in-research-1201087"><param name="allowFullScreen" value="true"><param name="allowScriptAccess" value="always"><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=userscamerondocumentsaa-isisdocspresentationsscience2-0esiweb2workshop-090326050843-phpapp01&amp;stripped_title=walking-the-walk-the-practical-experience-of-web2-in-research-1201087" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/CameronNeylon">CameronNeylon</a>.</div></div><div style="width:425px;text-align:left" id="__ss_1201945"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/wilbanks/etech-wilbanks?type=powerpoint" title="Etech Wilbanks">Etech Wilbanks</a> via <a href="http://scienceblogs.com/commonknowledge/2009/03/lions_tigers_and_crowds.php">common knowledge</a>, Lions, Tigers, Crowds<object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=etechwilbanks-090326074611-phpapp01&amp;stripped_title=etech-wilbanks"><param name="allowFullScreen" value="true"><param name="allowScriptAccess" value="always"><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=etechwilbanks-090326074611-phpapp01&amp;stripped_title=etech-wilbanks" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/wilbanks">John Wilbanks</a>.</div></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-1742697458165856645?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com4tag:blogger.com,1999:blog-26255983.post-80660572663839509522009-03-22T16:04:00.008+01:002009-03-22T19:50:45.352+01:00Bridging chemistry islands to the web data siloTriggered by a few announcements of major publishers and web-services I could not resist to create a little chemical-web 2.0-story.<br /><div style="width:425px;text-align:left" id="__ss_1180238"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/joergkurtwegner/bridging-chemistry-islands?type=powerpoint" title="Bridging Chemistry Islands">Bridging Chemistry Islands</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=bridgingchemistryislands-090322100123-phpapp01&rel=0&stripped_title=bridging-chemistry-islands" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=bridgingchemistryislands-090322100123-phpapp01&rel=0&stripped_title=bridging-chemistry-islands" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/joergkurtwegner">Joerg kurt Wegner</a>.</div></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-8066057266383950952?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com2tag:blogger.com,1999:blog-26255983.post-17767727019323707242009-03-18T21:27:00.011+01:002009-05-09T17:36:56.424+02:00Do drug development (D3) - Innovation while information ignorance is increasing?People thought always drug design is rational. Nowadays, things might have changed (maybe) ... and technology alone is not the solution. IT might support collaboration and information management, but we need seriously (a timely) action of legal advisors on supporting democratic-action and intellectual-property-protection.<br /><br /><a href="http://www.kk.org/thetechnium/archives/2008/10/the_expansion_o.php">Nova Spivack</a> has posted the interesting idea that the number of questions is faster growing than the number of answers. This forces us to ignore more facts, which leads us away from rational decision making, if we are not collaborating. I think, he has the point that in some areas we are facing <a href="http://miningdrugs.blogspot.com/2006/10/bounded-rational-drug-design.html">bounded rationality</a> in phases of a work process. On the other hand, I am willing to ignore information about the <a href="http://antwrp.gsfc.nasa.gov/apod/ap080403.html">south of orion</a> and rather read another drug design paper instead. The problem is that I can not read all 'relevant' articles (filter problem of <a href="http://miningdrugs.blogspot.com/2009/01/collaboration-bullseye-20-information.html">information overload</a>). Recently, I met <a href="http://www.linkedin.com/in/barryhardy">Barry Hardy</a> and he named it also the real-time problem of knowledge management. So, I have to filter, filter, and filter my priorities, which eliminates elements of making <a href="http://en.wikipedia.org/wiki/Serendipity">serendipitous</a> discoveries, and puts me on risk of being not innovative and <a href="http://en.wikipedia.org/wiki/Rational_ignorance">rational ignorant</a> in my work.<br /><br /><a href="http://miningdrugs.blogspot.com/2009/03/drug-design-paradigms-get-involved-in.html">Collaboration</a> and <a href="http://miningdrugs.blogspot.com/2007/11/six-rules-for-creating-data-driven-drug.html">data driven</a> decisions seem to be a good way for creating innovative ideas. Though, I think we will need a lot of <a href="http://dotank.nyls.edu/Aboutus.html">legal advice</a>, <a href="http://blog.wired.com/wiredscience/2008/08/clay-shirky-is.html">open source environmentalism</a> advice, and <a href="http://en.wikipedia.org/wiki/Data_security">data security</a> discussions before the drug design community can step into web 2.0 ! People do <a href="http://bhc3.wordpress.com/2009/03/17/how-to-tweet-your-way-out-of-a-job/">silly</a> things <a href="http://shankman.com/be-careful-what-you-post/">sometimes</a>, and even Google had already <a href="http://blogoscoped.com/archive/2009-03-08-n75.html">data leakage</a> problems. Usually, are internal people the <a href="http://www.networkworld.com/research/2008/011408-8-techs-data-leakage.html">major concern</a> for data leakage, not technology, or hackers. Data leakage is unacceptable for the drug design industry, where intellectual properties are critical for a drug design process of 10 to 15 years. Or we just need longer patent coverage. Some inventions take minutes, some decades, is it then fair that all patents have the same runtime?<br /><br />Anyway, <a href="http://blog.wired.com/wiredscience/2008/08/clay-shirky-is.html">Shirky</a> is right, we need to direct collaborative efforts. Just being open does not mean that questions are getting directed or answered (have seen this a lot). He mentions the <a href="http://dotank.nyls.edu/Aboutus.html">do tank</a> - the democratic action workshop - as solution? I certainly think that legal advisors should speed-up with closing more world2web and web2world gaps (terms from <a href="http://memebox.com/futureblogger/show/915-nova-spivack-s-web-as-world-observation-leads-us-further-down-the-rabbit-hole">Alvis Brigis</a>). Or is the information already faster growing than the legal procedures can protect it? To all the open data, open source, and open standard people out there ... I am still with you ... and as always, open must not mean free of charge, and I think we all need to get reimbursed or bound to licenses for avoiding that others are abusing our efforts.<br /><blockquote style="font-style: italic;">The <a href="http://dotank.nyls.edu/Aboutus.html">Do Tank</a> targets the "capability gap" in practicing collaboration and forming groups among people who realize the opportunity for more collaborative decision-making in their governments, communities, businesses, or other organizations but do not have the experience, skills, models or tools to fulfill the potential.<br />We bring "democratic" approaches to bear on our design work -- democratic understood, not as political ideology, but as a way of life where people work together to pursue shared goals.<br /></blockquote>Just from the standpoint of a scientist with a limited reading capacity; help me in talking to people which know more about it!<br /><div style="text-align: center;"><span class="Apple-style-span" style="color: rgb(0, 0, 238); text-decoration: underline;"><br /></span></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-1776772701932370724?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-40697060414796008272009-03-16T19:50:00.008+01:002009-03-17T19:04:03.081+01:00Reflect - annotate protein and small molecule names in web pages<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_cUdwVNvWCbc/Sb6fsMM1JQI/AAAAAAAAAKk/X_GZXVKE0zk/s1600-h/startPage_reflective_logo.png"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 264px; height: 161px;" src="http://2.bp.blogspot.com/_cUdwVNvWCbc/Sb6fsMM1JQI/AAAAAAAAAKk/X_GZXVKE0zk/s320/startPage_reflective_logo.png" alt="" id="BLOGGER_PHOTO_ID_5313860191881864450" border="0" /></a><span style="font-style: italic;"></span><blockquote><span style="font-style: italic;">The core component of Reflect is a consolidated dictionary that links names and synonyms to source data entries. It was created by merging the </span><a style="font-style: italic;" href="http://string.embl.de/">STRING</a><span style="font-style: italic;"> and </span><a style="font-style: italic;" href="http://stitch.embl.de/">STITCH</a><span style="font-style: italic;"> databases, and it currently contains over 1.5 million proteins from 373 organisms and 4.3 million small molecules..</span></blockquote>The <a href="http://reflect.ws/">Reflect</a> site annotates any web page provided, or installs a <a href="http://en.wikipedia.org/wiki/Mozilla_Firefox">FireFox</a> plug-in to allow annotation of any web page after a right mouse-click 'reflect this page'. Here an example for the <a href="http://en.wikipedia.org/wiki/P53">tumor protein p53</a>.<br /><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_cUdwVNvWCbc/Sb6hDEOSB6I/AAAAAAAAAK0/3wfhBhTgUNc/s1600-h/snapshot1.png"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 400px; height: 247px;" src="http://3.bp.blogspot.com/_cUdwVNvWCbc/Sb6hDEOSB6I/AAAAAAAAAK0/3wfhBhTgUNc/s400/snapshot1.png" alt="" id="BLOGGER_PHOTO_ID_5313861684389087138" border="0" /></a>The reflect team of <a href="http://schneider-www.embl.de/cms">Reinhard Schneider</a> is one of the <a href="http://www.elseviergrandchallenge.com/index.html">Elsevier grand challenge</a> finalists (sorry, for making the wrong statement before that they won this challenge, which is not known, yet).<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-4069706041479600827?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com3tag:blogger.com,1999:blog-26255983.post-23573329698438563952009-03-08T12:55:00.013+01:002009-03-08T19:54:29.219+01:00Drug design paradigms - get involved in helping patients - haystacks, islands, and cliffs<blockquote>"<span class="Apple-style-span" style="font-style: italic;">The availability of pharmaceuticals must not be taken for granted. Pharmaceutical research requires expertise, a large commitment, a great deal of patience and the courage to take large financial risks. What counts is a sustainable framework in Germany (and other countries, too) as a location for the pharmaceutical industry. This is not a guarantee for economic success but is certainly at its foundation.</span>" [<a href="http://www.vfa.de/download/SAVE/en/vfa-en/publikationen-en/e-wegetinvolved/e-wegetinvolved.pdf">we get involved</a> - vfa <a href="http://www.vfa.de/en/statistics/innovation/">innovation</a>]</blockquote><blockquote>"<span class="Apple-style-span" style="font-style: italic;">To bring personalized medicine forward, scientists, regulators, policy makers, and pharmaceutical company leadership will need to find ways to more successfully work together, adopt a new mindset, and take a networked approach to innovation—one in which we can successfully share pre-competitive information and common platforms such as biomarkers, genomic data, predictive toxicology, and serious adverse events information.</span>" [Paul Stoffels - <a href="http://www.cnbc.com/id/29548622">Modeling change</a>]</blockquote><span class="Apple-style-span" style="font-weight: bold;">Why is drug design an island-bridging-task?</span><br /><div>There are several drug design paradigms. Nowadays, it is more realistic to consider <a href="http://www.vfa.de/en/statistics/innovation/">innovative drug design</a> (<a href="http://www.vfa.de/download/SAVE/en/vfa-en/publikationen-en/e-statistics/e-statistics-2008.pdf">report for 2008</a>) as an <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/fda-application-approvals-2000-2008">island-bridging-task</a>, then some of the earlier drug design paradigms, like serendipity and searching in a haystack. </div><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_cUdwVNvWCbc/SbOyaSXKDEI/AAAAAAAAAKM/te4dlvzTpG4/s1600-h/bridging_islands_small.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 400px; height: 250px;" src="http://1.bp.blogspot.com/_cUdwVNvWCbc/SbOyaSXKDEI/AAAAAAAAAKM/te4dlvzTpG4/s400/bridging_islands_small.jpg" alt="" id="BLOGGER_PHOTO_ID_5310784550275255362" border="0" /></a><br /><div><span class="Apple-style-span" style="font-weight: bold;">What are cliffs we need to bridge?</span><br /></div><div><div><ul><li>patent cliffs - For having enough money for research and all regulatory steps for bringing a drug to the patient we need <a href="http://en.wikipedia.org/wiki/Intellectual_property">intellectual properties</a> and <a href="http://en.wikipedia.org/wiki/Patent">patents</a> for ensuring at least some rights on innovative drugs. Associated with a patent is the patent expiration. This means the network of inventors looses all rights for exclusivity and any other company can just use the final outcome of research efforts without reimbursing the inventors. Inventors (a pharma company) looses a lot of money when a drug goes off-patent. This challenge is called a <a href="http://biz.yahoo.com/zacks/090306/18028.html?.v=1&amp;.pf=family-home">patent cliff</a> many pharmaceutical companies are facing in the next years. Anyway, we have to accept that generica companies are <a href="http://www.fiercepharma.com/story/indian-drugmakers-push-us-market/2009-03-06">expanding</a>, sometimes even by <a href="http://www.fiercepharma.com/story/fda-says-ranbaxy-faked-drug-data/2009-02-26">faking</a> drug data.</li><li>activity cliffs - Do you think chemists and biologists can just make any chemical structure they want? Do you think that small drug modifications are having a small impact on the behaviour of a drug? No, this <a href="http://dx.doi.org/10.1021/jm8014102">is</a> - <a href="http://dx.doi.org/10.1021/ci7004093">not</a> - <a href="http://www.goodreads.com/book/show/2176917.Drug_like_Properties_Concepts_Structure_Design_and_Methods_from_ADME_to_Toxicity_Optimization">true</a>, especially, because drugs are behaving differently in different species and because drug properties are not rarely inversely correlated.</li><li>science cliffs - Though people might assume that reviewed journal articles and patent procedures are free of errors, this is not the case. There are scientific<a href="http://pipeline.corante.com/archives/2009/02/27/your_paper_is_a_sack_of_raving_nonsense_thank_you.php"> journal publications with errors</a> or <a href="http://pipeline.corante.com/archives/2009/03/03/how_good_or_bad_are_patent_procedures_anyway.php">patents of questionable value</a>. Beside, usually are most journals biased towards positive results, only.</li><li>technological cliffs - In short, there is more <a href="http://www.google.com/patents?q=inhibitor&amp;scoring=1&amp;rview=1">chemical</a>, <a href="http://friendfeed.com/e/5d3ea2dd-6e79-908c-238c-f279f2816cbb/1000-human-genomes-How-do-you-manage-1000/">biological</a>, and <a href="http://miningdrugs.blogspot.com/2009/01/collaboration-bullseye-20-information.html">literature</a> information a single person can handle. Even worse, still, we are facing many non-connected data islands (data silos), often even without proper semantic or annotation.</li><li>communication cliffs - Some people, have a <a href="http://friendfeed.com/e/603db6a4-322d-4ba9-8443-b67268cfaa6b/Trying-to-figure-out-how-to-engage-the-chemistry/">problem</a> with collaboration, maybe just because of the <a href="http://en.wikipedia.org/wiki/Not_Invented_Here">NIH syndrome</a>, or insufficient collaboration models?</li></ul><div><div><span class="Apple-style-span" style="font-weight: bold;">What are bridges?</span></div></div><ul><li>patent cliffs - We need a lot of contributors - in <a href="http://www.cnbc.com/id/29548622">pharma</a> - <a href="http://dx.doi.org/10.1126/science.1165706">academia</a> - and other <a href="http://www.pharmastrategyblog.com/2009/03/nih-and-science-stimulus-or-porkulus.html">institutions</a>.</li><li>activity cliffs - We need support and research from many people to achieve and understand <a href="http://www.fda.gov/cder/drug/DrugSafety/DrugIndex.htm">drug safety</a>, especially for novel disease areas and targets.<br /></li><li>science bridges - Discuss science on all levels, not only in peer-reviewed journals, <a href="http://miningdrugs.blogspot.com/2009/02/nature-mission-science-join-blogosphere.html">join the science community</a> on all levels, also by accepting, discussing, and avoiding reinventing <a href="http://friendfeed.com/e/590c7cec-bd3a-4244-a952-6cc2a9a90941/The-All-Results-Journals-Because-all-your-results/">negative results</a>.</li><li>technological cliffs - We need technological, and especially social support for <a href="http://miningdrugs.blogspot.com/2009/01/collaboration-bullseye-20-information.html">enriching</a> and <a href="http://www.chemspider.com/blog/surechem-as-the-entity-extractor-on-chemmantis.html">curating</a> scientific information.<br /></li><li>comunication cliffs - It was already shown that each single user contribution can enrich <a href="http://www.parc.com/research/publications/details.php?id=5904">knowledge dramatically</a>. For the chemistry area are the reason for missing user contributions <a href="http://mndoci.com/blog/2009/02/23/more-on-chemistry-and-the-data-web/">not totally clear</a>, but optimized technology and collaboration models seem to be very relevant.</li></ul><div><span class="Apple-style-span" style="font-weight: bold;">so, get involved ... in <a href="http://miningdrugs.blogspot.com/2007/05/there-are-too-many-silos-and-often.html">funneling</a> innovation ! Help <a href="http://en.wikipedia.org/wiki/Linus%27s_Law">eyeballing</a> !</span></div><br /><blockquote>"<span class="Apple-style-span" style="font-style: italic;">Never doubt that a small group of thoughtful, committed citizens can change the world. Indeed, it's the only thing that ever has.</span>" [<a href="http://en.wikipedia.org/wiki/Margaret_Mead">M. Mead</a>]</blockquote><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_cUdwVNvWCbc/SbPpX6JuiKI/AAAAAAAAAKU/VgnLk-0hdzM/s1600-h/Cueva_de_las_Manos1.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 267px; height: 320px;" src="http://3.bp.blogspot.com/_cUdwVNvWCbc/SbPpX6JuiKI/AAAAAAAAAKU/VgnLk-0hdzM/s320/Cueva_de_las_Manos1.jpg" alt="" id="BLOGGER_PHOTO_ID_5310844982556264610" border="0" /></a><div><span class="Apple-style-span" style="font-weight: bold;"><br /></span></div><div>References</div><div><ul><li><span class="Apple-style-span" style="font-style: italic;"><span class="Apple-style-span" style="font-style: normal;">D. D. Richman, D. M. Margolis, M. Delaney, W. C. Greene, D. Hazuda, R. J. Pomerantz, </span>The Challenge of Finding a Cure for HIV Infection</span>, Science, <span class="Apple-style-span" style="font-weight: bold;">2009</span>, <span class="Apple-style-span" style="font-style: italic;">323</span>, 1304-1307. DOI: <a href="http://dx.doi.org/10.1126/science.1165706">10.1126/science.1165706</a></li><li>All pictures taken from Wikipedia commons (<a href="http://commons.wikimedia.org/wiki/Bridge">bridge</a>, <a href="http://en.wikipedia.org/wiki/Island">island</a>) and compiled with <a href="http://en.wikipedia.org/wiki/Picasa">Picasa</a>.</li><li>E. Kerns, Drug-like Properties: <a href="http://www.goodreads.com/book/show/2176917.Drug_like_Properties_Concepts_Structure_Design_and_Methods_from_ADME_to_Toxicity_Optimization">Concepts, Structure Design and Methods</a>: from ADME to Toxicity Optimization, <span class="Apple-style-span" style="font-weight: bold;">2008</span>.</li><li>Cueva de las Manos, <a href="http://en.wikipedia.org/wiki/Cueva_de_las_Manos">hands</a>.</li></ul></div></div></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-2357332969843856395?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-84806767192424745572009-02-28T17:33:00.011+01:002009-03-04T19:08:03.207+01:00Nature - mission science - join the blogosphere<span style="font-weight: bold;">Nature's mission statement</span><br /><blockquote>"<span style="font-style: italic;">FIRST, to place before the general public the grand results of Scientific Work and Scientific Discovery ; and to urge the claims of Science to a more general recognition in Education and in Daily Life.</span>" [<a href="http://www.nature.com/npg_/company_info/mission.html">Nature's mission statement</a>]</blockquote><blockquote>"<span style="font-style: italic;">SECONDLY, to aid Scientific men themselves, by giving early information of all advances made in any branch of Natural knowledge throughout the world, and by affording them an opportunity of discussing the various Scientific questions which arise from time to time.</span>" [<a href="http://www.nature.com/npg_/company_info/mission.html">Nature's mission statement</a>]</blockquote><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_cUdwVNvWCbc/SalngCRCdpI/AAAAAAAAAI4/RV5r9uU-cnw/s1600-h/company-info-big.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 154px; height: 133px;" src="http://2.bp.blogspot.com/_cUdwVNvWCbc/SalngCRCdpI/AAAAAAAAAI4/RV5r9uU-cnw/s320/company-info-big.jpg" alt="" id="BLOGGER_PHOTO_ID_5307887435894650514" border="0" /></a><span style="font-weight: bold;">Nature's recommendation</span><blockquote>"More researchers should engage with the blogosphere, including authors of papers in press. ... There are societal debates that have much to gain from the uncensored voices of researchers. A good blogging website consumes much of the spare time of the one or several fully committed scientists that write and moderate it. But it can make a difference to the quality and integrity of public discussion."<br /><a href="http://www.nature.com/nature/journal/v457/n7233/full/4571058a.html">It's good to blog (editorial)</a>, Nature, 457, 1058, 2009. DOI <a href="http://dx.doi.org/10.1038/4571058a">10.1038/4571058a</a></blockquote><span style="font-weight: bold;">See also</span><br /><ul><li><a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">Six degrees of drug design</a><br /></li><li>Citalopram: <a href="http://pipeline.corante.com/archives/2009/02/27/your_paper_is_a_sack_of_raving_nonsense_thank_you.php">Derek</a><br /></li><li>Hexacyclinol: <a href="http://pipeline.corante.com/archives/2009/02/19/hexacyclinol_a_forensic_case.php">Derek</a>, and many more <span style="text-decoration: underline;"></span><a href="http://totallysynthetic.com/blog/?p=248">1</a>,<a href="http://pipeline.corante.com/archives/2006/06/05/hexacyclinol_or_not.php">2</a>,<a href="http://pubs.acs.org/cen/news/84/i31/8431notw1.html">3</a>,<a href="http://cenonline.blogs.com/sanfrancisco_2006/2006/09/hexacyclinol_sh.html">4</a>,<a href="http://www.thechemblog.com/?p=108">5</a>,<a href="http://ashutoshchemist.blogspot.com/2006/07/duplicate-duplicateand-sad-case-of.html">6</a>.</li></ul><br /><span style="font-weight: bold;">Molecules</span><br /><ul><li>Citalopram, InChIKey=WSEQXVZVJXJVFP-UHFFFAOYSA-N <script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=2669&amp;w=150&amp;eid=Science+2.0+%28Nature+recommends+to+blog%29"></script></li><li>Hexacyclinol, InChIKey=PFZFRWWDGXFULQ-VIPVPVOESA-N <script type="text/javascript" src="http://www.chemspider.com/csjsapi.ashx?op=img&amp;tk=a1ef575b-ce61-449f-89b6-b3c88e2dc295&amp;id=16788289&amp;w=150&amp;eid=Science+2.0+%28Nature+recommends+to+blog%29"></script></li></ul><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-8480676719242474557?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com2tag:blogger.com,1999:blog-26255983.post-40973381464338866502009-02-19T20:58:00.007+01:002009-02-19T22:18:43.955+01:00Chemical suppliers - CHEMCATS and the long tail<blockquote>If CAS is providing the head of the chemical supplier power law function, who is providing the tail?</blockquote>As pointed out by Derek <a href="http://pipeline.corante.com/archives/2009/02/18/supplies_of_suppliers.php">recently</a> and <a href="http://pipeline.corante.com/archives/2008/05/05/naming_of_names.php">earlier</a> exists there a vast amount of chemical suppliers. But honestly, how many people are taking the challenge of browsing all suppliers when looking for a chemical building block ? I would guess that more than 80% of the chemists are just using SciFinder, which is based on CHEMCATS. The really funny part is that <a href="http://www.cas.org/expertise/cascontent/chemcats/datasub.html">chemical suppliers have to upload their data to CAS</a>. So, <a href="http://en.wikipedia.org/wiki/Chemical_Abstracts_Service">CAS</a> is not only charging users for the look-up of chemical substances, but they are also charging suppliers for uploading their substance data, and for keeping it up-to-date.<br /><br />I am wondering, could there not be other business models? Which other services could provide the <a href="http://en.wikipedia.org/wiki/The_Long_Tail">long tail</a> of chemical supplier information?<br /><br />Two potential upcoming services (in the future, not populated with information right now) could be <a href="http://depth-first.com/articles/2009/02/19/chemvendor-the-web-based-chemically-enabled-product-management-system">ChemVendor</a> and <a href="http://www.chemspider.com/">ChemSpider</a>. Both allow uploading data, and I guess it is more a rhetorical question, if an upload of chemical supplier information could be possible in the future? In contrast, both services are looking for more data upload, user, and supplier contributions.<br /><br />Here are the requirements for the CHEMCATS upload, so can ChemVendor and ChemSpider allow the same? Well, lets ask them ...<br /><ul><li>Catalog name/product list name</li><li>Chemical name</li><li>Synonyms, Trade names</li><li>Catalog Number/Order Number</li><li>CAS Registry Number - Why not <a href="http://www.iupac.org/inchi/release102final.html">InChIKeys</a> ?<br /></li><li>Structure diagram<br /></li><li>Grades</li><li>Purities</li><li>Quantities available</li><li>Prices</li></ul>ChemSpider allows even hosting more information, e.g. <a href="http://www.chemspider.com/blog/the-spectral-game-as-an-educational-tool.html">Spectra</a>, so lets see what happens in the next months.<br /><br />Anyway, which other services of chemical suppliers do exist right now?<br /><ul><li><a href="http://www.chemicalbook.com/">Chemical Book</a></li><li><a href="http://www.chembuyersguide.com/">ChemBuyersGuide</a></li><li><a href="http://www.rdchemicals.com/">RDChemicals</a></li><li><a href="http://www.emolecules.com/">eMolecules</a></li></ul>Of course, are there much more services out there, but honestly, even if I would list all of them, who would check all of them?<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-4097338146433886650?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com6tag:blogger.com,1999:blog-26255983.post-77910188311567165002009-02-10T19:25:00.008+01:002009-02-10T22:04:24.503+01:00Life science - Are the IT guys getting it ?<blockquote>"<span style="font-style: italic;">My argument is this: there is so much useful information that needs to be organized in a format that is free, navigational and easy to access. One person cannot do it alone; we all need to contribute for the betterment of science, in general. ... I’ll gladly contribute! How do we get the ball rolling?</span>" [<a href="http://therealmoforganicsynthesis.blogspot.com/2009/02/throwing-down-gauntlet-for-my-fellow.html">J</a>]<br /></blockquote><a href="http://therealmoforganicsynthesis.blogspot.com/2009/02/throwing-down-gauntlet-for-my-fellow.html">The Realm of Organic Synthesis</a> initiated a discussion about chemistry knowledge management, which has triggered a lot of follow-up discussions.<br /><br /><a href="http://therealmoforganicsynthesis.blogspot.com/2009/02/throwing-down-gauntlet-for-my-fellow.html?showComment=1234156560000#c3712166507188810968">Mitch</a> (<a href="http://www.chemistry-blog.com/">blog</a>) said that chemical informatics guys do not get it:<br /><blockquote>"<span style="font-style: italic;">What J and I (Mitch) are planning is truly awesome, but it is not going to fall within the realm of the type of collaborative work that the chemical informatics people know so well. J will announce the project in the next couple of days. I think you will find it completely awesome, or simply not get it.</span>"</blockquote> I am really curious to see what they are planning. I agree with <a href="http://depth-first.com/articles/2009/02/09/chemical-informatics-guys-dont-get-it">Rich</a> that many tools and databases are of questionable use for people working on a daily basis as chemists or with chemists or biologists.<br /><ul><li>Just by accident, an (industry) colleague was asking me today, if I can not recommend an open-source tool for browsing through a <a href="http://en.wikipedia.org/wiki/SD_format">SDFile</a>, which is the de-facto standard for exchanging chemical data with experimental biological activity. Honestly, my first thought was <a href="http://www.bioclipse.net/">BioClipse 2beta</a>, which I have not looked at for months. I know that people are working on it for years, but guess what, SDFile browsing was not working for the data files I tried. As quick-fix I recommended the freeware (not open source) <a href="http://www.hyleos.net/?s=applications&amp;p=ChemFileBrowser">ChemFileBrowser</a>. Not to forget all the commercial options I told him.<br />Anyway, looking forward for other recommendations I am not aware of ?</li><li>I agree again with Rich that CAS number searches are important. After the earlier discussions, we know that there is <a href="http://miningdrugs.blogspot.com/2008/03/cas-numbers-are-not-public-domain-are.html">little room for fair-use</a> as brought up by <a href="http://www.chemspider.com/blog/cas-discourages-using-scifinder-to-help-curate-wikipedia-structures-and-cas-numbers.html">Tony</a>. Still, <a href="http://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Chemistry/CAS_validation#New_announcement_from_CAS">at the end</a> (of what?), the CAS guys are officially supporting Wikipedia, but is <a href="http://depth-first.com/articles/2008/05/26/simple-cas-number-lookup-and-more-with-chempedia">ChemPedia</a> legal with all the CAS numbers it hosts?<br /></li><li>PubChem: Well, lets put it that way, I know it, but how useful is it in industry? Lets face it, (dirty) quantity is not everything, but quality (<a href="http://www.chemspider.com/">ChemSpider</a>), and most important <a href="http://www.emolecules.com/doc/plus/">purchasebility</a> (<a href="http://www.emolecules.com/">eMolecules</a>). At the end of the day, we want to create compounds, not just look at them on the screen.<br /></li></ul>Anway, J and Mitch, on the one hand you say you want to collaborate on the other hand you are not interested in the collaborative work of the cheminformatics guys. Sorry, I am confused !<br />I really hope you do not want to create yet-another-<a href="http://miningdrugs.blogspot.com/2007/01/why-wasting-time-if-others-can-help-you.html">collaboration</a>-<a href="http://miningdrugs.blogspot.com/2006/09/even-if-online-what-is-missing-for.html">online</a>-<a href="http://miningdrugs.blogspot.com/2006/11/blogging-chemistry-means-not-blogging.html">data</a><a href="http://miningdrugs.blogspot.com/2006/11/blogging-chemistry-means-not-blogging.html">-</a><a href="http://miningdrugs.blogspot.com/2006/11/blogging-chemistry-means-not-blogging.html">silo</a>. I must agree with <a href="http://mndoci.com/blog/2009/02/08/dont-consolidate-distribute/">Deepak</a>, that just historically, it is unlikely that a single whatever service will make it. So, we have to think about some sort of communication between services. As I commented on Egons post, I believe in the <a href="http://chem-bla-ics.blogspot.com/2009/02/where-can-i-host-my-experimental-data.html?showComment=1234205940000#c5887293420831846370">social principles</a>, rather trust people than <a href="http://miningdrugs.blogspot.com/2009/01/collaboration-bullseye-20-information.html">technology alone</a>. Any form of knowledge is just data, and faces the risk of creating a silo: in people, in departments, and in institutions.<br /><br />Let me quote you again<br /><span style="font-style: italic;"></span><blockquote><span style="font-style: italic;">"much useful information that needs to be organized in a format that is free, navigational and easy to access"</span></blockquote>What a challenge. You can have<br /><ul><li>little structured information, easy to organize, easy to navigate<br /></li><li>much structured information, easy to organize, difficult to navigate</li><li>much unstructured information, difficult to organize, difficult to navigate, some people call this life science, and lets not even get started with the whole <a href="http://en.wikipedia.org/wiki/Omics">omics</a> thing ;-) Well, maybe <a href="http://www.youtube.com/user/SRIBRG">a little</a> !<br /></li></ul>Making it even more complicated, you mentioned collaboration, so<br /><ul><li>little contribution, easy group dynamics, no technical requirements, just take a piece of paper on a round-table</li><li>lot of contribution, difficult group dynamics, massive technical challenges ranging from communication standards to performance problems.</li></ul>At the end, I see some discrepancy between "lot of information" and "easy to do", so, again, curious to see what happens next. As long as people keep talking and the ball rolling, everything is possible ...<br /><br />... and some IT guys are really <a href="http://miningdrugs.blogspot.com/2007/11/andy-grove-when-silicon-indoctrinates.html"><span style="font-weight: bold;">not</span> getting</a> it !<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-7791018831156716500?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com7tag:blogger.com,1999:blog-26255983.post-32382120948747701872009-01-17T18:46:00.027+01:002009-02-20T23:20:15.745+01:00Collaboration bullseye 2.0: Information overload, filter failure, and ways out<blockquote>"<span style="font-style: italic;">Using the Internet today is like trying to use a library where all the books have been dumped on the floor and the lights turned out.</span>" [<a href="http://www.ibiblio.org/librariesfaq/sect8.htm#8.11">A.J. Vendeland, 1996</a>]</blockquote><blockquote>"<span style="font-style: italic;">A metaphor I use to describe the Internet and now web, how it is organized, and how useful the stuff is that one can find there. My kitchen junk drawer has lots of good useful stuff, mixed in with lots of useless stuff that I have no need for but I have kept anyway, just in case!</span>" [<a href="http://info.org.il/english/books_on_the_floor.html">J. C. Neale</a>]<br /></blockquote>This post has two major goals. One is to disentangle the difference between information overload and filter failure in science. The second is to show a potential way-out of both problems.<br /><br />This post was triggered by a controversial discussions, saying that in science information overload IS or IS NOT filter failure.<br /><ol><li><a href="http://www.youtube.com/watch?v=LabqeJEOQyI">IS</a> - Clay Shirky started the whole "It's not <a href="http://en.wikipedia.org/wiki/Information_overload">information overload</a>- It's filter failure" discussion and explains why. He brings even a scientific chemistry study group example after 15' in the video.</li><li><a href="http://friendfeed.com/e/cf26a58a-3205-b6ca-f426-26414a3d575a/Information-overload-is-NOT-filture/">IS NOT</a> - Bench Marks raises three major points against this idea in the context of science. First, filtering is already done, better than ever, but still the amount of information is too high. Second, he has concerns trusting a random group of web 2.0 strangers. Third, he thinks science will suffer, if the paper reading is left to other (not cutting edge) researchers, because they might not be good enough in connecting the innovative dots.<br /></li><li><a href="http://scienceoftheinvisible.blogspot.com/2009/01/information-overload-is-filter-failure.html">IS</a> - Alan Cann says that "Conventional wisdom holds that Aristotle was the last person to know everything (Neill &amp; Ridley 1995 The Philosophy of Art: Readings Ancient and Modern. McGraw Hill). That probably wasn't true - Aristotle was a victim of Rumsfeld's Syndrome - he didn't know what he didn't know. Certainly, every scientist since Aristotle has suffered from information overload - even if they didn't know it. So if we can't know everything, what do we have to do? Filter !"<br /></li><li><a href="http://friendfeed.com/e/339c8553-9c21-42ca-ba80-0b03209ea6d6/Bench-Marks-Blog-Archive-Information-Overload/">IS NOT</a> - Bench Marks clarifies again that scientists are already filtering like crazy, but due to the exponential growth in research and journals, there is more knowledge they can assimilate. He offers also two solutions. One is to specialize more, which seems contradictory to the cross-disciplinary world out there. Second, he recommends spending more time with reading, if scientists want to work in a cutting edge area.<br /></li></ol>Before I will start with discussing actual business trends and solutions, it is important to understand that there are <a href="http://blogs.law.harvard.edu/digitalnatives/2008/11/03/information-overwhelm-two-types-of-overload/">two major forms</a> of information overload. I think one of them relates more to the 'classic' information overload problem (rabbit hole risk), the other more to a filtering problem (Spigot risk). Still, both things are highly related and I am not sure if any differentiation is of much use. At the end we need to tackle both problems, e.g. by collaborating.<br /><br /><span style="font-size:130%;"><span style="font-weight: bold;">Rabbit hole risk of information overload</span></span><br /><blockquote>"<span style="font-style: italic;">Suddenly she sees a white rabbit run past wearing a coat and carrying a watch, lamenting running late. She follows it down a rabbit hole and falls very slowly down a tunnel lined with curious objects.</span>" [<a href="http://en.wikipedia.org/wiki/Alice%27s_Adventures_in_Wonderland">Alice in Wonderland</a>]<br /></blockquote>In short, this is the effect of highly non-linear information, e.g. encyclopedias, web pages with links, and scientific papers with references, which allow you do get easily distracted by more and more information, e.g. via links.<br /><br />I stopped counting the number of deja-vu's experiences, where I was facing this effect. I started with a single scientific article, and was ending up with an even larger number of papers on my reading list afterwards. Especially, review articles have usually more references I put later on my to-do list. So, this phrase 'follow the white rabbit', is not only something for <a href="http://en.wikipedia.org/wiki/Follow_the_white_rabbit">computer geeks</a>, but also a well-known information overload risk to knowledge and science junkies. Honestly, when thinking about cutting edge scientists, then my guess is, that they are more stressed than others, just because they are always trying to stay close to the scientific <a href="http://en.wikipedia.org/wiki/Oligarchy">oligarchy</a>. People which have stopped trying to catch up, do not care if they do *not* read five or twenty articles, news alerts, or encyclopedia entries.<br /><div style="text-align: left;"><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_cUdwVNvWCbc/SXIZt-Hxd9I/AAAAAAAAAIY/boX5nK1mK6s/s1600-h/rabbit_hole_risk.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 200px; height: 300px;" src="http://1.bp.blogspot.com/_cUdwVNvWCbc/SXIZt-Hxd9I/AAAAAAAAAIY/boX5nK1mK6s/s320/rabbit_hole_risk.gif" alt="" id="BLOGGER_PHOTO_ID_5292320789674162130" border="0" /></a><br /><span style="font-size:130%;"><span style="font-weight: bold;">Spigot risk of information overload</span></span><br />The spigot risk is that the incoming scientific information flow is not only continuous, but also <a href="http://www.nature.com/nature/journal/v440/n7083/full/440413a.html">exponential</a>, while filtering and reading capacity is usually not.<br /><br />I would bet that anyone who tried to stay competitive has <a href="http://friendfeed.com/e/46630939-6544-43f1-a456-4712cad904f4/My-unread-email-and-RSS-count-Discuss/">faced this problem already</a>, especially people with the above mentioned rabbit-hole-syndrome. To get out of this dilemma, I rather trust people, than any technical solution (alone) to filter, rank, and annotate information, e.g. eMail alerts, or RSS streaming. This is also one reason why all the systems out there should allow a collaborating human factor to allow positive networking effects.<br /></div><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_cUdwVNvWCbc/SXIZqLdhMsI/AAAAAAAAAIQ/_hw1k9EJtZI/s1600-h/spigot_risk.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 281px; height: 296px;" src="http://2.bp.blogspot.com/_cUdwVNvWCbc/SXIZqLdhMsI/AAAAAAAAAIQ/_hw1k9EJtZI/s320/spigot_risk.gif" alt="" id="BLOGGER_PHOTO_ID_5292320724535554754" border="0" /></a><span style="font-size:130%;"><span style="font-weight: bold;">Why social software might be a solution</span></span><br />The evidence for this can not only be found in the <a href="http://www.parc.com/research/publications/details.php?id=5904">wisdom of the crowds</a> on Wikipedia, but also in several top web enterprises, just look at the <a href="http://www.alexa.com/site/ds/top_sites?ts_mode=global&amp;lang=none">top 500 web companies</a> and how many of the top-scorers are based on social software elements.<br />As highlighted by <a href="http://www.youtube.com/watch?v=3NN3uej7IMs">Andrew McAfee</a> (<a href="http://andrewmcafee.org/blog/">blog</a>) is the key element to trust people at some point, which is definitely better than trusting reading or paper writing bots. He is showing how important it is to keep social networks on several levels (they are ranging from tight, weak, potential, to none). This is the major key for understanding the resulting collaboration forms based on this <a href="http://andrewmcafee.org/blog/?p=470">collaboration bullseye</a>.<br /><br />Again, why should we care?<br /><br />Should we not only rely on internal knowledge and closed groups? Actually, no ! We know already the problem of <a href="http://miningdrugs.blogspot.com/2006/10/bounded-rational-drug-design.html">bounded rational drug design</a>. We also know that "the problem of a rational economic order is… that the knowledge… of which we must make use never exists in concentrated or integrated form but solely as the dispersed bits of incomplete and frequently contradictory knowledge" [<a href="http://en.wikipedia.org/wiki/Friedrich_Hayek">F. Hayek</a>].<br />In other words, nobody expects that a single person knows everything.We know that people are getting more specialized and that we are facing a wide range of diverse cutting-edge experts. Guess what, I think this is exactly what we need, but <a href="http://andrewmcafee.org/blog/?p=462">not on all collaboration levels</a>. Andrew McAfee confirmed this also with the numbers of problems solved by <a href="http://en.wikipedia.org/wiki/InnoCentive">InnoCentive</a>. The numbers of problems solved was usually higher, if the diversity of the scientific contributors was higher. I must admit I was amazed to see that you can find a lot of <a href="http://gw.innocentive.com/ar/discipline?categoryName=Chemistry">chemical</a> and <a href="http://gw.innocentive.com/ar/discipline?categoryName=Life%20Sciences">life science</a> challenges on <a href="http://innocentive.com/">InnoCentive</a> (<a href="http://en.wikipedia.org/wiki/InnoCentive">@WP</a>).<br /><br />Finally, we have to trust other people, and they are not all random, some are already in a tight or at least weak collaboration and information network with you. Sorry, <a href="http://www.cshblogs.org/cshprotocols/2009/01/14/information-overload-is-not-filture-failure/">Bench Marks</a>, I would rephrase "To a hammer the world looks like nails" to<br /><blockquote>"I am a scientific nail, but I trust a hammer often. If not, I want to be close to other scientific nails. Lets make sure most nails are strengthening their networks and collaborations, not only their (closed) knowledge levels and data silos."<br /></blockquote>I would highly recommend to<a href="http://www.youtube.com/watch?v=3NN3uej7IMs"> follow McAfee's talk after (47:51)</a>, because he addresses also the intellectual property conflict question (or how open should we be?) and mentions highly respected life science companies like <a href="http://en.wikipedia.org/wiki/Pfizer">Pfizer</a>, <a href="http://en.wikipedia.org/wiki/Eli_Lilly_and_Company">Eli Lilly</a>, and <a href="http://en.wikipedia.org/wiki/Innocentive">InnoCentive</a>.<br /><br />Beside all the things already discussed, I also mention things I find useful for finding, filtering, collecting, and ranking information.<br /><ul><li>Searching - I am using <a href="http://en.wikipedia.org/wiki/IGoogle">iGoogle</a> as my personal homepage, and I have created a <a href="http://miningdrugs.blogspot.com/2007/01/ncbi-pubmed-and-doi-searches-on-google.html">few gadget</a> for <a href="http://miningdrugs.blogspot.com/2007/01/emolecules-and-leo-searches-on-google.html">helping</a> me, but I am also using things, which were already available, e.g. a <a href="http://www.google.com/ig/directory?url=bennokr.googlepages.com%2Fpdb-searcher.xml">RCSB</a> search.</li><li>Collecting - Since I am not the only one <a href="http://miningdrugs.blogspot.com/2007/01/why-wasting-time-if-others-can-help-you.html">loosing time</a> in finding relevant information, I use my very personal mixture of content management systems (<a href="http://en.wikipedia.org/wiki/Content_management_system">CMS</a>) like <a href="http://en.wikipedia.org/wiki/MindManager">MindManager</a>, <a href="http://en.wikipedia.org/wiki/XWiki">XWiki</a>, and a <a href="http://miningdrugs.blogspot.com/2007/05/literature-management.html">LaTeX/BibTeX</a> solution for keeping track of my read articles.<br /></li><li>Social networking - Well, the people which know me, are fully aware, that I like communities and that I appreciate community efforts. I think everybody, which tried to get community things going knows that it is not always easy to reach a <a href="http://miningdrugs.blogspot.com/2007/09/wikipedia-hits-two-million-articles-not.html">tipping point</a>. Some previous posts might highlight this:<ul><li> <a href="http://miningdrugs.blogspot.com/2008/06/science-editorial-social-or-both.html">editorial, social, or both</a> - again, I believe in <a href="http://en.wikipedia.org/wiki/Linus%27s_Law">Linus' law</a>: given enough eyeballs, all bugs are shallow<br /></li><li><a href="http://miningdrugs.blogspot.com/2007/12/community-moves-on-merry-xmas-and.html">a community moves on</a> - well, also networking costs time</li><li><a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">six degrees of drug design</a> - I believe in diverse within <a href="http://en.wikipedia.org/wiki/Six_degrees_of_separation">six steps</a> </li><li><a href="http://miningdrugs.blogspot.com/2006/09/quality-management-and-cheminformatics.html">quality improvements</a> are always possible</li><li>Finally, I am working for a pharmaceutical company, so I am open, but not stupid. Anybody in this industry knows that intellectual property protection, project work, external collaborators, and any target activities are highly confidential. So, if you are working on such things, like I do, do not expose this in any form on any typical open social network tool like your blog, <a href="http://en.wikipedia.org/wiki/Facebook">Facebook</a>, <a href="http://en.wikipedia.org/wiki/Twitter">Twitter</a>, <a href="http://en.wikipedia.org/wiki/Friendfeed">FriendFeed</a>, <a href="http://en.wikipedia.org/wiki/Plaxo">Plaxo</a>, whatever (see <a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">cons of social networking</a> or listen to Clay Shirky's <a href="http://www.youtube.com/watch?v=LabqeJEOQyI">friend story</a>). If you want to talk to people, follow the <a href="http://en.wikipedia.org/wiki/Non-disclosure_agreement">legal rules</a> and <a href="http://en.wikipedia.org/wiki/Intellectual_property">policies</a> !<br /></li></ul></li></ul><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_cUdwVNvWCbc/SXL5i1dwKmI/AAAAAAAAAIg/Xsd9L04Q70A/s1600-h/Six_degrees_of_separation.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 262px;" src="http://4.bp.blogspot.com/_cUdwVNvWCbc/SXL5i1dwKmI/AAAAAAAAAIg/Xsd9L04Q70A/s320/Six_degrees_of_separation.gif" alt="" id="BLOGGER_PHOTO_ID_5292566888976493154" border="0" /></a><br /><span style="font-weight: bold;font-size:130%;" >References/Credits</span><span style="font-size:130%;"><br /></span><ul><li>via <a href="http://www.ibiblio.org/">ibiblio</a> <a href="http://www.ibiblio.org/librariesfaq/sect8.htm#8.11">Libraries FAQ</a>: "<span style="font-style: italic;">Using the Internet today is like trying to use a library where all the books have been dumped on the floor and the lights turned out.</span>" A.J. Vendeland, in Review of The Software Publishers Association Europe Conference, Conference Analysis June 28, <span style="font-weight: bold;">1996</span>.</li><li>Image: Down the rabbit hole <a href="http://en.wikipedia.org/wiki/File:Down_the_Rabbit_Hole.png">@Wikipedia</a>, modification was created with <a href="http://en.wikipedia.org/wiki/GIMP">GIMP</a> and is available on request.<br /></li><li>Image: FlowingTap <a href="http://commons.wikimedia.org/wiki/File:Flowingtap.jpg">@Wikipedia</a>, modification was created with <a href="http://en.wikipedia.org/wiki/GIMP">GIMP</a> and is available on request.</li><li>Image: Six degrees of separation <a href="http://en.wikipedia.org/wiki/File:Six_degrees_of_separation.png">@Wikipedia</a>, modification was created with <a href="http://en.wikipedia.org/wiki/GIMP">GIMP</a> and is available on request.</li><li>The two forms of information overload on <a href="http://www.digitalnative.org/">DigitalNatives</a> were found via <a href="http://www.cyberculture.ro/blog/2009/01/05/warning-information-overload/">Strangers On The Net</a>.<br /></li></ul><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-3238212094874770187?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-27781389901584113722009-01-04T15:37:00.024+01:002009-01-05T19:38:20.293+01:00Antithrombotic graph visualization - Chemistry 2.0 and the ChemSpider journal<div style="text-align: center;"><img src="http://www.joergkurtwegner.de/blog/20090104/xa5.gif" alt="" border="0" height="135" width="120" /><img src="http://www.joergkurtwegner.de/blog/20090104/xa1.gif" alt="" border="0" height="120" width="127" /></div><blockquote>"<span style="font-style: italic;">It’s not Information Overload. It’s Filter Failure.</span> <span style="font-style: italic;">If you have the same problem for a long time, maybe it is not a problem, maybe it is a fact ... some of it is rethinking social norm ... we have continuously to improve filters, that just broke !</span>" [<a href="http://www.youtube.com/watch?v=LabqeJEOQyI">Clay Shirky</a>]<br /><br />"<span style="font-style: italic;">We intend to demonstrate how modern web technologies can be used to dramatically enhance the type of information that can be communicated using web-based tools over standard online publishing approaches.</span>" [<a href="http://www.chemspider.com/blog/announcing-the-chemspider-journal-of-chemistry.html">ChemSpider journal</a> (<a href="http://www.chemspider.com/blog/the-chemspider-journal-of-chemistry-editorial-board.html">editorial board</a>)]<br /></blockquote>Inspired by the story about the chemical-study-group example of <a href="http://www.youtube.com/watch?v=LabqeJEOQyI">Clay Shirky</a> (after minute 15), I got again reminded about the <a href="http://en.wikipedia.org/wiki/Information_overload">information overload</a> problem. I agree totally with Clay that a long persisting problem, encourages a desire to <a href="http://en.wikipedia.org/wiki/Change_management_%28people%29">get it changed</a>. As <a href="http://www.slideshare.net/AntonyWilliams/chemspider-presentation-at-the-acs-meeting-in-new-orleans-presentation">shown by Tony</a> from ChemSpider does the life science <a href="http://www.plaxo.com/groups/profile/103081406774">community</a> have the (technical) ability to enrich chemical data.<br /><br />So, are there just (social) barriers left? According to VonHippel do single users rarely show enough investment into innovations, if they think that this is only their single problem, but not the problem of a larger community (<a href="http://web.mit.edu/evhippel/www/democ1.htm">see chapter four</a>). But wait, I was always assuming that the community is having this problem <a href="http://miningdrugs.blogspot.com/2008/06/science-editorial-social-or-both.html">together</a>, not only some single people living with their data silos.<br /><br />Again, and in-line with Clay and VonHippel, are there good reasons to rethink social norms for improving information filter processes, e.g. the ones with <a href="http://miningdrugs.blogspot.com/2007/01/why-wasting-time-if-others-can-help-you.html">buried</a>, <a href="http://miningdrugs.blogspot.com/2007/07/case-review-2007.html">wrong</a>, <a href="http://en.wikipedia.org/wiki/Deep_Web">hidden</a>, or <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=28">shrouded</a> data.<br /><br />Now, lets have a look at an <a href="http://en.wikipedia.org/wiki/Antithrombotic">antithrombotic</a> drug design data example, based on <a href="http://en.wikipedia.org/wiki/Web_2.0">community web communication</a>.<br />I used the Fontaine et al. factor Xa data set (EC <a href="http://www.chem.qmul.ac.uk/iubmb/enzyme/EC3/4/21/6.html">3.4.21.6</a>) from <a href="http://www.cheminformatics.org/datasets/index.shtml">cheminformatics.org</a> and enriched it with <a href="http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/enzymes/GetPage.pl?ec_number=3.4.21.6">PDB</a> <a href="http://ligand-expo.rcsb.org/">ligand structures</a> (<a href="http://www.joergkurtwegner.de/blog/20090104/fXa.sdf.zip">final structure file</a>). Factor Xa inhibitors are inhibiting the <a href="http://scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.c.hh.b.c.ei.html">trypsin-like</a> serine protease, involved in the <a href="http://en.wikipedia.org/wiki/Coagulation">blood coagulation</a> process.<br /><br />What has this data now to do with drug design? One very reasonable question could be<br /><blockquote>Which <a href="http://en.wikipedia.org/wiki/X-ray_crystallography">XRay</a> ligands are closest to the Fontaine et al. <a href="http://en.wikipedia.org/wiki/Structure-activity_relationship">structure-activity relationship</a> data for allowing <a href="http://en.wikipedia.org/wiki/Drug_design#Rational_drug_design">structure-based drug design</a>?</blockquote>In short, there might be many ways to give an answer. Here, we are now interested in checking the functionality of <a href="http://www.chemspider.com/">ChemSpider</a> and <a href="http://blueobelisk.sourceforge.net/wiki/Main_Page">blue obelisk</a> tools, e.g. <a href="http://openbabel.org/wiki/Python">Pybel</a>. The idea is also to move towards <a href="http://en.wikipedia.org/wiki/Web_2.0">web-based</a> and <a href="http://michaelnielsen.org/blog/?p=526">scientific</a> collaboration models, e.g. via <a href="http://rguha.wordpress.com/2008/12/10/chemistry-in-google-docs">google docs</a>.<br /><br />First, since I strongly believe in the benefits of curated data and community efforts, I was checking how much factor Xa data was already contributed and curated within ChemSpider. Honestly, I was facing a couple of challenges (or filter failures) by trying to answer this question. It is already easy to upload data to ChemSpider, but it is less efficient to find out, if those molecules are already registered there? Please mind, that especially people in industry have to protect <a href="http://en.wikipedia.org/wiki/Intellectual_property">intellectual properties</a> and can not always upload data immediately, but maybe contribute on a technical level or after <a href="http://www.freepatentsonline.com/help/item/When-does-a-patent-expire.html">patent expiration</a>. Based on the InChIKey (<a href="http://depth-first.com/articles/2007/10/01/streamlining-cheminformatics-on-the-web-let-inchi-do-the-heavy-lifting-and-get-some-rest">1</a>, <a href="http://www.chemspider.com/blog/?p=188">2</a>, <a href="http://www.chemspider.com/blog/we-need-an-inchikey-resolver-and-we-need-it-now.html">3</a>) it was possible to create an <a href="http://www.joergkurtwegner.de/blog/20090104/inchikey2cid.html">InChIKey2CID</a> script. Though, this web-based retrieval is still extremely slow and suboptimal, and certainly not usable for thousands of molecules. Nonetheless, none of the factor Xa molecules was uploaded to ChemSpider, not even the PDB ligands (see <a href="http://www.joergkurtwegner.de/blog/20090104/fXa.inchi.cid.tab.txt">flat file</a> result). So, still some work to do in the future ;-)<br /><br />Second, for now ChemSpider supports mainly a quick similarity check on InChIKeys, neglecting stereo information. Strictly speaking is this only one very specfific way of a similarity search. In other words, users are left with the option for a <a href="http://www.chemspider.com/PubChemSearch.aspx">PubChem search</a>, which covers also the ChemSpider data. Maybe there exist already a web-service for returning for a single molecule query the most similar or the most five similar compounds, but I have not found out how.<br />At the end, it was easier to follow Noels Pybel examples (<a href="http://openbabel.org/wiki/Python">general</a>, <a href="http://baoilleach.blogspot.com/2008/07/calculate-circular-fingerprints-with.html">fp</a>, <a href="http://baoilleach.blogspot.com/2007/07/pybel-hack-that-sd-file.html">sd</a>) and creating a <a href="http://www.joergkurtwegner.de/blog/20090104/molsim2xgml.html">molsim2gxml</a> script. The script will not only calculate fingerprint similarities between molecules, but also an <a href="http://www.yworks.com/products/yfiles/doc/developers-guide/xgml.html">XGML</a> output for the data visualization.<br /><div style="text-align: center;"><img src="http://www.joergkurtwegner.de/blog/20090104/xa5.gif" alt="" border="0" height="240" width="210" /><br /><a href="http://en.wikipedia.org/wiki/Factor_X#Factor_Xa">Xa</a> ligand similarity (top 5 compounds).<br />Original graph files (<a href="http://www.yworks.com/en/products_yed_about.html">yEd</a>): <a href="http://www.joergkurtwegner.de/blog/20090104/xa5.gml">gml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa5.graphml">graphml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa5.xgml">xgml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa5.svg">svg</a>.<br /><a href="http://www.tulip-software.org/">Tulip</a> acccepts also the <a href="http://en.wikipedia.org/wiki/Graph_Modelling_Language">GML</a> format.</div><br /><br /><div style="text-align: center;"><img src="http://www.joergkurtwegner.de/blog/20090104/xa1.gif" alt="" border="0" height="210" width="224" /><br /><br /><a href="http://en.wikipedia.org/wiki/Factor_X#Factor_Xa">Xa</a> ligand similarity (best compound, most similar).<br />Original graph files (<a href="http://www.yworks.com/en/products_yed_about.html">yEd</a>): <a href="http://www.joergkurtwegner.de/blog/20090104/xa1.gml">gml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa1.graphml">graphml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa1.xgml">xgml</a>, <a href="http://www.joergkurtwegner.de/blog/20090104/xa1.svg">svg</a>.<br /><a href="http://www.tulip-software.org/">Tulip</a> acccepts also the <a href="http://en.wikipedia.org/wiki/Graph_Modelling_Language">GML</a> format.</div><br />Finally, we obtain two graphs showing the similarity (connectivity) of the factor Xa SAR training data (blue), test data (green), and the XRay data (red). Each node represents one molecule, each edge links either the most similar compound (lower graph), or the five most similar compounds (upper graph).<br />In the lower graph, we see that only two XRay structures (indicated in red) are close to the factor Xa SAR data in the training set (indicated in blue). One is a zinc cation, which is in this case not interesting for ligand design, the other is the <a href="http://www.rcsb.org/pdb/search/smartSubquery.do?smartSearchSubtype=LigandIdQuery&amp;chem_comp.id=ZEN">ZEN-PDB ligand</a>. This ligand can be found in the PDB structures <a href="http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1j17">1j17</a>, <a href="http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1ql7">1ql7</a>, <a href="http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1ql8">1ql8</a>, <a href="http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1ql9">1ql9</a>, and <a href="http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1v2k">1v2k</a>, which could be a good starting point for structure-based drug design (SBDD) and structure-property-relationship (SPR) transfer. On the other hand can we see that the overall distance between the Fontaine SAR data (blue and green) and the XRay data (red) can be easily spotted. So, be warned when trying any SPR transfer, and we have not even started talking about 3D ligands and the <a href="http://dx.doi.org/10.1016/j.jmb.2003.11.041">protein flexibility</a> !;-)<br /><br />So, yes, some of the public SAR data can be linked to Xray information and could support structure-based design. Anyway, a fast answer to the original question took some time. And could this now be uploaded to support the drug design community and avoiding this kind of filter failure problem for other community members? I am not aware that the <a href="http://www.ebi.ac.uk/">EBI</a>, or <a href="http://www.ncbi.nlm.nih.gov/">NCBI</a> services allow data upload.<br />On the other hand, would ChemSpider and the <a href="http://www.chemspider.com/blog/announcing-the-chemspider-journal-of-chemistry.html">ChemSpider journal</a> (<a href="http://www.chemspider.com/blog/the-chemspider-journal-of-chemistry-editorial-board.html">editorial board</a>) benefit from additional services, like fast InChIKey checks, or similarity searches.<br /><br /><span style="font-weight: bold;">References</span><br /><ul><li><a href="http://www.joergkurtwegner.de/blog/20090104/molsim2xgml.html">molsim2xgml</a> python script</li><li><a href="http://www.joergkurtwegner.de/blog/20090104/inchikey2cid.html">inchikey2cid</a> python script</li><li><a href="http://www.yworks.com/">yWorks</a> - the diagramming company</li><li>F. Fontaine, M. Pastor, I. Zamora, F. Sanz<br /><span style="font-style: italic;">Anchor-GRIND: Filling the Gap between Standard 3D QSAR and the GRid-INdependent Descriptors</span><br />J. Med. Chem., <span style="font-weight: bold;">2005</span>, <span style="font-style: italic;">48</span>, 2687-2694. DOI <a href="http://dx.doi.org/10.1021/jm049113+">10.1021/jm049113+</a></li><li>N. M. O'Boyle, C. Morley and G. R. Hutchison<br /><span style="font-style: italic;">Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit</span><br />Chem. Cent. J. <span style="font-weight: bold;">2008</span>, <span style="font-style: italic;">2</span>, 5.<br />DOI <a href="http://dx.doi.org/10.1186/1752-153X-2-5">10.1186/1752-153X-2-5</a></li><li>R. Guha, M. T. Howard, G. R. Hutchison, P. Murray-Rust, H. Rzepa, C. Steinbeck, J. Wegner, E. L. Willighagen<br /><span style="font-style: italic;">The Blue ObeliskInteroperability in Chemical Informatics</span><br />J. Chem. Inf. Model., <span style="font-weight: bold;">2006</span>, <span style="font-style: italic;">46</span>, 991–998<br />DOI <a href="http://dx.doi.org/10.1021/ci050400b">10.1021/ci050400b</a></li><li>D. Rauh, G. Klebe, M. T. Stubbs<br /><span class="Apple-style-span" style="font-style: italic;">Understanding Protein–Ligand Interactions: The Price of Protein Flexibility</span><br />Journal of Molecular Biology, <span class="Apple-style-span" style="font-weight: bold;">2004</span>, <span class="Apple-style-span" style="font-style: italic;">335</span>, 1325-1341.</li>DOI <a href="http://dx.doi.org/10.1016/j.jmb.2003.11.041">10.1016/j.jmb.2003.11.041</a></ul><span style="font-weight: bold;">See also</span><br /><ul><li>Six degrees of drug design, <a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html?showComment=1187114100000">blog post</a>.</li><li>Innovation funnel, <a href="http://miningdrugs.blogspot.com/2007/05/there-are-too-many-silos-and-often.html">blog post</a>.<br /></li></ul><a href="http://dx.doi.org/10.1021/jm049113+"></a><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-2778138990158411372?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com1tag:blogger.com,1999:blog-26255983.post-82947864784637865692008-12-20T15:54:00.005+01:002008-12-23T20:55:09.140+01:00Drug designers are creating innovative ideas<blockquote><span class="Apple-style-span" style="font-style: italic;">"The FDA will approve at least 21 first-of-a-kind drugs this year, up from 18 in 2007. And the number of black-box warnings decreased by 25 percent. The boxed warnings are the most serious a drug can carry.</span>" [<a href="http://www.salon.com/wires/ap/business/2008/12/19/D955VDSG1_fda_2008_drug_reviews/">salon.com</a>]</blockquote><div style="text-align: left;">It is well known that researchers in pharma are <a href="http://pipeline.corante.com/archives/2008/12/16/layoffs_more_layoffs.php">facing</a> challenging <a href="http://miningdrugs.blogspot.com/2007/11/commenting-on-grim-days-for-pharma.html">times</a>. Nonetheless, the FDA has <a href="http://www.salon.com/wires/ap/business/2008/12/19/D955VDSG1_fda_2008_drug_reviews/">increased the number</a> of approved drugs last year (21 drugs compared to 18 in 2007). This in combination with the <a href="http://www.vfa.de/download/SAVE/de/presse/publikationen/statistics2008.pdf">increasing innovation</a> in pharmaceutical research gives hope for the next years.<br /></div><div><div style="text-align: left;"><br /></div><div style="text-align: center;"><a href="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/drug-innovations/comments/3c7116d2cea711ddaed4000255111976" style="margin: 0pt; padding: 0pt;"><img alt="3c3f8d9c-cea7-11dd-aed4-000255111976" src="http://manyeyes.alphaworks.ibm.com/manyeyes/files/thumbnails/3c3f8d9c-cea7-11dd-aed4-000255111976.png?size=400x300" style="border: 1px solid #AF755D; margin: 0; padding-top: 10px; padding-bottom: 15px;" /></a><br /></div><div style="text-align: left;"><br /></div><div style="text-align: left;">The question for me is, can this industry still be innovative with all those layoffs, and still make it a <a href="http://miningdrugs.blogspot.com/2007/11/commenting-on-grim-days-for-pharma.html">mission possible</a> for the next years? I know that some people think that technology can help<br /></div><blockquote><div style="text-align: left;">"<span class="Apple-style-span" style="font-style: italic;">The constant march of technology makes it possible for a smaller work force to do virtually the same job as a larger workforce.</span>" [<a href="http://www.pcworld.com/article/154450/google_layoffs_10000_jobs_being_cut_report_claims.html">10k Google layoffs</a>]</div><div style="text-align: left;"></div></blockquote><div style="text-align: left;">Honestly, this <a href="http://miningdrugs.blogspot.com/2007/11/andy-grove-when-silicon-indoctrinates.html">sounds reasonable</a> for the chip and software industry, but does this also work on scientific matters, especially for <a href="http://en.wikipedia.org/wiki/Drug_design">drug design</a>? If we keep thinking and are ready for change, this <a href="http://miningdrugs.blogspot.com/2006/10/disruptive-technology-in-drug-design.html">might be possible</a>.</div><div style="text-align: left;"><br /></div><div style="text-align: left;"><span class="Apple-style-span" style="font-weight: bold;">See also</span></div><div style="text-align: left;"><ul><li><a href="http://miningdrugs.blogspot.com/2008/01/boutique-drugs-splitting-mass-market.html">Boutique drugs</a><br /></li><li><a href="http://miningdrugs.blogspot.com/2007/05/there-are-too-many-silos-and-often.html">Innovation funnel</a></li></ul></div><a href="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/drug-innovations/comments/3c7116d2cea711ddaed4000255111976" style="margin: 0pt; padding: 0pt;"></a></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-8294786478463786569?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-66520579427308642352008-12-17T20:33:00.003+01:002008-12-17T20:54:00.108+01:00Group Contribution revisited (via Fragment Based Drug Design)<blockquote>"<span style="font-style: italic;">What chemists really need is reliable data, and biophysical techniques</span>" [<a href="http://practicalfragments.blogspot.com/2008/11/more-fragment-events-for-2009.html">FBDD events 2009</a>]</blockquote>(via <a href="http://practicalfragments.blogspot.com/2008/12/50-aint-half-bad.html">practical fragments</a>)<br /><blockquote style="font-style: italic;">In the world of fragment-based ligand discovery, researchers hope that two fragments, when linked together, will behave at least additively: the free energies of binding for each fragment will sum together, with a multiplicative effect on affinity. In ideal cases, linked fragments will behave synergistically. But all too often, linking two fragments produces disruptive behavior, and the resulting molecule actually binds less tightly than would be predicted based on the binding energies of the individual fragments. This occurs not just when linking fragments, but in fragment merging and growing as well. Can such phenomena be modeled?<br />The <a href="http://pubs.acs.org/doi/abs/10.1021/jm00334a001">mathematical groundwork was described</a> more than forty years ago by Spencer Free and James Wilson at the old Smith Kline and French company, and came to be known as a Free-Wilson analysis. In a <a href="http://pubs.acs.org/doi/abs/10.1021/jm801070q">nice update of this work</a>, Julen Oyarzabal and co-workers have applied this technique to ... a variety of different targets, including a kinase, GPCRs, ion channels, and P450s.<br />...<br />But even if synergy is elusive, the paper suggests that you’ve got a better than even shot of producing a whole that is at least equal to the sum of its parts.<br /></blockquote>For an excellent pragmatic review about FBDD see also the comments in <a href="http://practicalfragments.blogspot.com/2008/11/more-fragment-events-for-2009.html">FBDD events 2009</a>.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-6652057942730864235?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-69544095154647174222008-12-15T20:06:00.007+01:002008-12-15T21:07:47.249+01:00Mining clinical trials<blockquote>"<span style="font-style: italic;">This (clinical data mining) application enables rapid extraction of information about institutions, diseases, clinical approaches, clinical trials dates, predominant cancer types in the trials, clinical opportunities and pharmaceutical market coverage.</span>" [<a href="http://dx.doi.org/10.1186/1745-7580-4-7">10.1186/1745-7580-4-7</a>]</blockquote><br />(<a href="http://www.earlham.edu/%7Epeters/fos/2008/12/data-mining-from-clinicaltrialsgov.html">via open access news</a>)<br /><p>In 2007 more than seven million people died from <a href="http://en.wikipedia.org/wiki/Cancer">cancer</a>. This means there is still a lot to do for helping patients and for reducing this number. This is also the reason why each new treatment option has to follow good design <span class="blsp-spelling-corrected" id="SPELLING_ERROR_0">criteria</span> and sufficient testing, called <a href="http://en.wikipedia.org/wiki/Clinical_trial">clinical trial design</a>. Typical design <span class="blsp-spelling-corrected" id="SPELLING_ERROR_1">criteria</span> could be:<br /></p> <ul><li>assess the safety and effectiveness of a new medication or device on a specific kind of patient (e.g., patients who have been diagnosed with <a href="http://en.wikipedia.org/wiki/Alzheimer%27s_disease" title="Alzheimer's disease">Alzheimer's disease</a>)</li><li>assess the safety and effectiveness of a different dose of a medication than is commonly used (e.g., 10 mg dose instead of 5 mg dose)</li><li>assess the safety and effectiveness of an already marketed medication or device for a new indication, i.e. a disease for which the drug is not specifically approved</li><li>assess whether the new medication or device is more effective for the <span class="blsp-spelling-corrected" id="SPELLING_ERROR_2">patients</span> condition than the already used, standard medication or device ("the gold standard" or "standard therapy")</li><li>compare the effectiveness in patients with a specific disease of two or more already approved or common interventions for that disease (e.g., Device A vs. Device B, Therapy A vs. Therapy B)</li></ul>Today, at the 2008-12-15, are there <a href="http://clinicaltrials.gov/search/term=%22cancer%22">20292 clinical trials with a cancer treatment</a> published, some are recruiting, some are ongoing, and some are completed. This information can be accessed in <a href="http://clinicaltrials.gov/search/term=%22cancer%22?displayxml=true&amp;count=200&amp;start=1">XML format</a> (<a href="http://clinicaltrials.gov/ct2/html/images/info/public.dtd">DTD</a>) (please read <a href="http://clinicaltrials.gov/ct2/info/terms">terms and conditions</a>) for allowing an easier <a href="http://en.wikipedia.org/wiki/Data_mining">data mining</a> on those large data sets. One method is for example the ability to <a href="http://dx.doi.org/10.1186/1745-7580-4-7">browse through cancer vaccine trials</a> via <a href="http://en.wikipedia.org/wiki/Spotfire"><span class="blsp-spelling-error" id="SPELLING_ERROR_3">SpotFire</span></a>, a <a href="http://en.wikipedia.org/wiki/Business_intelligence">business intelligence</a> tool.<br /><br />Other options could be to cross-link this information, e.g. by using <a href="http://friendfeed.com/e/47d8a241-37fa-78f7-a2f3-506983573e64/Adding-Publications-to-ChemSpider-via-Digital/">crowd-sourcing</a> concepts (<a href="http://en.wikipedia.org/wiki/Crowdsourcing"><span class="blsp-spelling-error" id="SPELLING_ERROR_4">Crowdsourcing</span></a>@<span class="blsp-spelling-error" id="SPELLING_ERROR_5">Wikipedia</span>).<br /><br /><span style="font-weight: bold;">References</span><br /><a href="http://dx.doi.org/10.1186/1745-7580-4-7"></a><ul><li>X. <span class="blsp-spelling-error" id="SPELLING_ERROR_6">Cao</span>, K. B. <span class="blsp-spelling-error" id="SPELLING_ERROR_7">Maloney</span>, V. <span class="blsp-spelling-error" id="SPELLING_ERROR_8">Brusic</span>, Data mining of cancer vaccine trials: a bird's-eye view,<br /><span class="blsp-spelling-error" id="SPELLING_ERROR_9">Immunome</span> Research 2008, 4. <span class="blsp-spelling-error" id="SPELLING_ERROR_10">doi</span>:<a href="http://dx.doi.org/10.1186/1745-7580-4-7">10.1186/1745-7580-4-7</a></li><li><a href="http://clinicaltrials.gov/ct2/info/linking">Linking and downloading of <span class="blsp-spelling-error" id="SPELLING_ERROR_11">ClinicalTrials</span>.gov data</a>. Please read <a href="http://clinicaltrials.gov/ct2/info/terms">Terms and Conditions</a>. </li></ul><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-6954409515464717422?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-77717941055674669272008-12-06T17:43:00.007+01:002008-12-06T18:20:28.795+01:00Public data sets on AWS (Amazon Web Services)<blockquote><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://media.amazonwebservices.com/logo_aws.gif"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 164px; height: 60px;" src="http://media.amazonwebservices.com/logo_aws.gif" border="0" alt="" /></a><span class="Apple-style-span" style="font-style: italic;">"AWS hopes to provide researchers across a variety of disciplines and industries with tools to enable more innovation, more quickly"</span> [<a href="http://aws.amazon.com/publicdatasets/">Public data on AWS</a>]</blockquote><span class="Apple-style-span" style="font-weight: bold;"><div><span class="Apple-style-span" style="font-weight: normal;">If you are interested in <a href="http://en.wikipedia.org/wiki/Cloud_computing">cloud computing and storage</a> then you will be delighted that Amazon offers support to the science community. Though appreciated, do I think that the information provided for the data sets is very sparse, e.g. which conformer generation was used for the chemistry data sets? How exactly was the UGI dataset created?</span></div><div><span class="Apple-style-span" style="font-weight: normal;"><br /></span></div><div><span class="Apple-style-span" style="font-weight: normal;">Anyway, this sounds similar to the computing and storage solution of Google, called <a href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> and <a href="http://en.wikipedia.org/wiki/Google_File_System">GFS</a>. I do not know enough about it to understand the technical differences between the frameworks of Amazon and Google, but both are used to work with tons of data. As announced by Amazon would users (or companies) only have to pay for their user storage and computing time, but not for the public data. This is fair, but I was also wondering, if people could buy technical support as well?</span></div><div><br /></div>BIOLOGY</span><br /><ul><li>Annotated Human Genome Data provided by <a href="http://www.ensembl.org/">ENSEMBL</a> - An annotated form of the Human Genome, perfect for biological research</li><li><a href="http://www.ncbi.nlm.nih.gov/Genbank/">GenBank</a> provided by the National Center for Biotechnology Information – COMING SOON - An annotated collection of all publicly available DNA sequences including more than 85.7B bases and 82.8M sequence records</li><li><a href="http://www.ncbi.nlm.nih.gov/unigene">UniGene</a> provided by the National Center for Biotechnology Information – COMING SOON - A set of transcript sequences of well-characterized genes and hundreds of thousands of expressed sequence tags (EST)</li></ul><span class="Apple-style-span" style="font-weight: bold;">CHEMISTRY</span><br /><ul><li>A 3D Version of the <a href="http://pubchem.ncbi.nlm.nih.gov/">PubChem</a> Library provided by Rajarshi (<a href="http://rguha.wordpress.com/">blog</a>, <a href="http://jijo.cjb.net/">homepage</a>) - A 3D (single conformer) version of Pubchem, a public database of chemical structures in SD Format</li><li>UGI Virtual Conformer Library provided by Rajarshi (<a href="http://rguha.wordpress.com/">blog</a>, <a href="http://jijo.cjb.net/">homepage</a>) - 80GB of data in SD format on conformers for 500,000 molecules that can be used for virtual screening</li><li><a href="http://pubchem.ncbi.nlm.nih.gov/">PubChem</a> Library provided by by the National Center for Biotechnology Information – COMING SOON - A data set of information on the biological activities of small molecules</li></ul><div>(via <a href="http://delicious.com/url/52f1c856d2c5679e12234c6132525bae">Egon@del.icio.us</a>)</div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-7771794105567466927?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-51660335153351203232008-11-09T16:23:00.008+01:002008-11-09T16:55:39.257+01:00Python Training for Cheminformatics and Modeling<blockquote>"<span style="font-style:italic;">the best choice of high-level programming language for computational chemistry is <a href="http://en.wikipedia.org/wiki/Python_(programming_language)">Python</a></span>" [A. Dalke]</blockquote><div>Many cheminformatic/modeling toolkits provide <a href="http://en.wikipedia.org/wiki/Python_(programming_language)">Python</a> interfaces, as shown in the summary below. This is a good reason to consult Andrew for a <a href="http://dalkescientific.com/training/">training sessions</a>, or checking <a href="http://www.dalkescientific.com/writings/diary/">Andrew's</a> or <a href="http://baoilleach.blogspot.com/">Noel's</a> blogs from time to time. The various open source projects, from which a few are stalled, support Python, as well as the academic/commercial <a href="http://www.eyesopen.com/products/toolkits/oechem.html">OEChem</a> toolkit. Two very important modeling programs supporting Python are <a href="http://www.schrodinger.com/scripts.html">Schrödinger's</a> modeling suite and <a href="http://pymol.sourceforge.net/">PyMol</a>. For a training on the last two programs, you best contact the software support teams directly.<br /></div><div><br /></div><div>Please let me know, if other molecular modeling/ cheminformatics/ statistical modeling programs should be mentioned here !</div><br /><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 385px; height: 210px;" src="http://dalkescientific.com/writings/toolkit-timeline.png" border="0" alt="" /><div style="text-align: center;">[source: EuroQSAR poster, A. Dalke, 2008, <a href="http://www.dalkescientific.com/writings/diary/archive/2008/09/20/euroqsar.html">URL</a>, <a href="http://dalkescientific.com/writings/Python-EuroQSAR2008-large.pdf">PDF</a>]<br /></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-5166033515335120323?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com4tag:blogger.com,1999:blog-26255983.post-42164147374929186292008-11-08T13:49:00.007+01:002008-11-09T17:04:27.080+01:00Support the star of life 2.0<blockquote><span class="Apple-style-span" style="font-style: italic;">Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge.</span><br />— <a href="http://en.wikipedia.org/wiki/Jimmy_Wales">Jimmy Wales, Founder of Wikipedia</a></blockquote><div style="text-align: center;"><blockquote><div style="text-align: left;"><span class="Apple-style-span" style="font-style: italic;">Imagine the sum of all natures knowledge can maintain and restore </span><a href="http://en.wikipedia.org/wiki/Health" title="" style="color: rgb(0, 43, 184); background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; text-decoration: underline; background-position: initial initial; "><span class="Apple-style-span" style="font-style: italic;">health</span></a><span class="Apple-style-span" style="font-style: italic;"> by the </span><a href="http://en.wikipedia.org/wiki/Prevention_(medical)" title="Prevention (medical)" style="color: rgb(0, 43, 184); background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; text-decoration: underline; background-position: initial initial; "><span class="Apple-style-span" style="font-style: italic;">prevention</span></a><span class="Apple-style-span" style="font-style: italic;"> and </span><a href="http://en.wikipedia.org/wiki/Treatment" title="Treatment" style="color: rgb(0, 43, 184); background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; text-decoration: underline; background-position: initial initial; "><span class="Apple-style-span" style="font-style: italic;">treatment</span></a><span class="Apple-style-span" style="font-style: italic;"> of </span><a href="http://en.wikipedia.org/wiki/Illness" title="" style="color: rgb(0, 43, 184); background-image: none; background-repeat: initial; background-attachment: initial; -webkit-background-clip: initial; -webkit-background-origin: initial; background-color: initial; text-decoration: underline; background-position: initial initial; "><span class="Apple-style-span" style="font-style: italic;">illness</span></a><span class="Apple-style-span" style="font-style: italic;">, as symbolized by the </span><a href="http://en.wikipedia.org/wiki/Star_of_Life"><span class="Apple-style-span" style="font-style: italic;">star of life</span></a><span class="Apple-style-span" style="font-style: italic;">. </span><br /></div><div style="text-align: left;">— free interpretation of <a href="http://en.wikipedia.org/wiki/Medicine">medicine</a> and <a href="http://en.wikipedia.org/wiki/Web_2.0">web 2.0</a></div></blockquote> </div><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://wikimediafoundation.org/wiki/Donate/en"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 198px; height: 192px;" src="http://upload.wikimedia.org/wikipedia/commons/a/ae/Star_of_life.svg" border="0" alt="star of life" /></a><div style="text-align: center;"><a href="http://wikimediafoundation.org/wiki/Donate/en"><img border="0" alt="Wikipedia Affiliate Button" width="410" height="54" src="http://upload.wikimedia.org/wikipedia/foundation/1/1a/2008_fundraiser_banner_button-en.png" /></a><br /></div><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-4216414737492918629?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0tag:blogger.com,1999:blog-26255983.post-25078884223033291532008-08-03T20:01:00.006+02:002008-08-03T22:25:32.058+02:00Data, models, or both?<blockquote><span style="font-style: italic;">Correlation supersedes causation, and science can advance even without coherent models, unified theories, or really any mechanistic explanation at all.</span> [<a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_theory">C. Anderson, Wired, 2008-07-16</a>]</blockquote>First, I disagree with this statement. Second, thanks to <a href="http://delicious.com/bioinfoman3">bioinfoman3@delicious</a> for sharing this information.<br /><br />Honestly, I do not get it, why are people claiming that the scientific community, e.g. <a href="http://en.wikipedia.org/wiki/Drug_design">drug design</a>, is similar to the <a href="http://miningdrugs.blogspot.com/2007/11/andy-grove-when-silicon-indoctrinates.html">chip industry</a> or <a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_theory">google concepts</a>. Here, Chris Anderson claimed that data alone will replace theoretic concepts. As said by others, in the comments to his article, is data alone not information. <a href="http://amundblog.blogspot.com/2008/07/rebirth-of-confounding-and-theory-in.html">Amund Tveit</a> confirms this by showing that data correlations alone might be misleading, because you can find correlations in everything, even if it makes no sense. Data means any data, so if you look-up <a href="http://www.google.be/search?q=nme">NME</a> in google then you will not find a <a href="http://en.wikipedia.org/wiki/New_chemical_entity">new molecular entity</a> as first hit, though this it what I was expecting. So, I have serious doubts that this kind of thinking is supporting science at all, since it might lead away from the right path.<br /><br />In the drug design area or other domains with chemical and biological information it is well known that the <a href="http://depth-first.com/articles/2007/03/05/why-the-web-isnt-ready-for-chemistry">raw data web is not ready</a> or, at best, <a href="http://miningdrugs.blogspot.com/2007/06/chemical-online-diaries-still.html">suboptimal</a> for chemistry. Let us assume that we have already fully equipped and finished data <a href="http://miningdrugs.blogspot.com/2007/10/supertarget-matador-chemspider-when.html">curation</a>, and <a href="http://miningdrugs.blogspot.com/2008/07/wikipathways-challenge-of-community.html">ontology</a> projects, which provide information, not only data. Does this information-rich data help making new drugs without any models? I doubt so! Beside, the curation and ontology projects are far from being finished, so I doubt so heavily ! We have to accept that any data set will be always incomplete or noisy, even if clean music, book, gaming, and video data would be added. Would that support drug design or other scientific areas?<br />As said earlier<br /><blockquote><span style="font-style: italic;">Just accept that we are working in an area where the chemical space is just too large for allowing us to get the complete picture of it. Thus, embrace incompleteness (bounded rational drug design).</span> [<a href="http://miningdrugs.blogspot.com/2007/11/six-rules-for-creating-data-driven-drug.html">Six Rules For Creating a Data Driven Drug Design Project</a>, Mining Drug Space, 2007-11-02]</blockquote>I completely agree, that<br /><blockquote><span style="font-style: italic;">All models are wrong, but some are useful.</span>. [<a href="http://en.wikiquote.org/wiki/George_E._P._Box">G. E. P. Box</a>]</blockquote>By the way, if Chris Anderson is stating this at the <a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_theory">very beginning of his article</a>, why is he then trying to neglect the usefulness of models afterwards?<br /><br />If people keep claiming that they know how to change science positively, then I am wondering, if they know that data generation itself might be a serious bottleneck. In drug design some effects, e.g. <a href="http://en.wikipedia.org/wiki/ADME">ADME</a> and <a href="http://en.wikipedia.org/wiki/Toxicity">toxicity</a>, are really hard and expensive to measure. Believe me, if the model disbelievers tell us that we have already enough data and models for understanding oral drugs in humans, then anyone in the drug design industry would be happy to not only reduce, but stop, human <a href="http://en.wikipedia.org/wiki/Clinical_trial">clinical tests</a> and provide life saving drugs to patients immediately.<br /><br />Finally, I strongly believe, that we need as much connected and information-rich data as possible (so far I align with C. Anderson), that we need good models for supporting decision making processes and saving time and environmental resources, and that we need a combination of both for innovative scientific thinking and novel developments.<div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-2507888422303329153?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com6tag:blogger.com,1999:blog-26255983.post-47866407680773697362008-07-27T21:27:00.000+02:002008-07-27T21:44:40.438+02:00Quotes from the neighbors (July 2008)==<a href="http://pipeline.corante.com/archives/2008/07/16/receptors_cant_live_with_em_cant_understand_em.php">Receptors: Can't Live With 'Em, Can't Understand 'Em (In the pipeline)</a>==<br /><blockquote><span style="font-style: italic;">At various points in my drug discovery career, I’ve worked on G-protein-coupled receptor (<a href="http://en.wikipedia.org/wiki/G_protein-coupled_receptor">GPCR</a>) targets. Most everyone in the drug industry has at some point – a significant fraction of the known drugs work through them, even though we have a <a href="http://omicsomics.blogspot.com/2007/10/at-long-last-2nd-gpcr-crystal-structure.html">heck of a time</a> knowing what their structures are like.<br />For those outside the field, GPCRs are a ubiquitous mode of signaling between the interior of a cell and what’s going on outside it, which accounts for the hundreds of different types of the things. ...</span></blockquote>==<a href="http://ashutoshchemist.blogspot.com/2008/06/dimers-long-sought-minimal-culprit-in.html">A-β Dimers- The Long-Sought Minimal Culprit in Alzheimer's Disease? (The Curious Wavefunction)</a>==<br /><blockquote style="font-style: italic;">Following on the heels of the headline-making Nature publication that demonstrated that NSAIDs (Non-steroidal AntiInflammatory Drugs) uniquely targeted a substrate (<a href="http://en.wikipedia.org/wiki/Amyloid_precursor_protein">APP</a>) rather than an active site of the gamma-secretase complex involved in plague formation in Alzheimer's (see <a href="http://mwclarkson.blogspot.com/2008/06/nsaids-bind-to-amyloid.html">Discount Thoughts</a> for a great summary) comes a paper that may turn out to be one of the important papers in the history of Alzheimer's disease (AD) research.</blockquote>==<a href="http://mwclarkson.blogspot.com/2008/06/nsaids-bind-to-amyloid.html">NSAIDs bind to amyloid-β (Discount thoughts)</a>==<br /><span style="font-style: italic;"></span><blockquote><span style="font-style: italic;">One of the best-known features of </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Alzheimer%27s_disease">Alzheimer's disease</a><span style="font-style: italic;"> pathology is the formation of proteinaceous </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Senile_plaques">amyloid plaques</a><span style="font-style: italic;"> in the brain. In Alzheimer's disease these plaques are primarily formed by the </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Amyloid_beta">amyloid-β peptide</a> (Aβ) derived from the <a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Amyloid_precursor_protein">amyloid precursor protein</a><span style="font-style: italic;"> (APP) by the action of β- and </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Gamma_secretase">γ-secretase</a><span style="font-style: italic;">. The length of the Aβ peptide varies, but the 42-residue form (Aβ42) is more likely to form plaques and fibrils.</span></blockquote><span style="font-style: italic;"></span><div class="blogger-post-footer"><img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/26255983-4786640768077369736?l=miningdrugs.blogspot.com'/></div>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.com0