tag:blogger.com,1999:blog-262559832008-07-17T01:30:56.205+02:00Mining Drug SpaceJoerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comBlogger123125tag:blogger.com,1999:blog-26255983.post-84513528105807069462008-06-19T19:58:00.006+02:002008-06-19T22:19:32.151+02:00OpenAstexViewer - an open structural biology viewerAs posted by <a href="http://baoilleach.blogspot.com/2008/06/jmol-gets-competition-openastexviewer.html">Noel</a> and <a href="http://depth-first.com/articles/2008/06/19/3d-molecular-visualization-on-the-web-openastexviewer">Rich</a> was the <a href="http://en.wikipedia.org/wiki/Java_%28programming_language%29">Java</a>-based AstexViewer just got <a href="http://en.wikipedia.org/wiki/LGPL">LGPL</a>ed and voila! Here it is, the <a href="http://openastexviewer.net/"><span style="font-weight: bold;">Open</span>AstexViewer</a> for structure-based drug design.<br /><br />I personally was especially interested in the electron density functionality and how easy it would be creating a view on an <a href="http://en.wikipedia.org/wiki/Enzyme">enzymatically active-site</a>.<br /><br />As example I used <a href="http://en.wikipedia.org/wiki/Lisinopril">Lisinopril</a> an <a href="http://en.wikipedia.org/wiki/Angiotensin-converting_enzyme">ACE</a> (Angiotensin-converting enzyme) inhibitor. This class is seen as one of the success stories of rational drug design, based on the structural biology of <a href="http://en.wikipedia.org/wiki/Carboxypeptidase_A">carboxypeptidase A</a> and <a href="http://en.wikipedia.org/wiki/Medicinal_chemistry">medicinal chemistry</a>.<br />My first thought was using <a href="http://en.wikipedia.org/wiki/Captopril">Captopril</a>, the initial drug for this class, but I could not find an electron density map and had to pick another example. Since the PDB structure (<a href="http://www.rcsb.org/pdb/explore.do?structureId=2C6N%0A%20%20%0APreview">2C6N</a>) of Lisinopril has a <a href="http://eds.bmc.uu.se/cgi-bin/eds/uusfs?pdbCode=2c6n">deposited electron density map</a> this was now used.<br /><br />After downloading the program and loading the structure and the map, I needed only a few mouse-clicks for selecting the ligand and creating a view on active site surface. Here is the result.<br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp1.blogger.com/_cUdwVNvWCbc/SFqqOX4QWDI/AAAAAAAAAF4/Eve32jCVFIY/s1600-h/2c6n_lisinopril.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_cUdwVNvWCbc/SFqqOX4QWDI/AAAAAAAAAF4/Eve32jCVFIY/s320/2c6n_lisinopril.gif" alt="" id="BLOGGER_PHOTO_ID_5213666682539628594" border="0" /></a>The drug is shown in the active site as sticks using atom coloring with having the zinc ion on the left in gray. The active site surface shows the lipophilicity of the protein residues. In orange we see the electron density map, which indicates a reasonable binding mode of this ligand.<br /><br />Finally, but a little off-topic, I was also curious, if <a href="http://www.simbiosys.ca/ehits_lasso/index.html">LASSO</a> on <a href="http://www.chemspider.com/">ChemSpider</a> identifies <a href="http://www.chemspider.com/Chemical-Structure.4514933.html">this molecule</a> as an ACE inhibitor? In short, yes this target class gets one of the highest scores. Nice, having some online tools around for collecting quickly drug information.<br /><br />After all of this I must say that the OpenAstexViewer is a very elegant, easy to use tool, which is extremely powerful. Has anyone tried already the scripting abilities ?<br /><br /><span style="font-weight: bold;">References</span><br /><ul><li>M. Adam, <span style="font-style: italic;">Integrating research and development: the emergence of rational drug design in the pharmaceutical industry</span>, Stud Hist Philos Biol Biomed Sci, <span style="font-weight: bold;">2005</span>, <span style="font-style: italic;">36</span>, 513-37. <a href="http://philsci-archive.pitt.edu/archive/00002397/01/Adam_rational_drug_design.pdf">PDF</a>.<br />PMID: <a href="http://www.ncbi.nlm.nih.gov/pubmed/16137601">16137601</a><br /></li><li>H.R. Corradi, S.L. Schwager, A.T. Nchinda, E.D. Sturrock, K.R. Acharya, <span style="font-style: italic;">Crystal structure of the N domain of human somatic angiotensin I-converting enzyme provides a structural basis for domain-specific inhibitor design</span>, J Mol Biol, <span style="font-weight: bold;">2006</span>, <span style="font-style: italic;">357</span>, 964-974. PMID: <a href="http://www.ncbi.nlm.nih.gov/pubmed/16476442">16476442</a></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-41991990603490909392008-06-01T21:10:00.002+02:002008-06-01T23:28:05.526+02:00Science - editorial, social, or bothIt was almost two months ago that <a href="http://www.sciencebase.com/science-blog/online-science.html">David Bradley</a> asked the question (via <a href="http://www.linkedin.com/answers/technology/biotech/TCH_BIO/199495-3051673?browseIdx=0&sik=1207132602926&goback=%2Eamq">LinkedIn</a>), if science may benefit from social software? Several people responded, and especially <a href="http://www.cshblogs.org/cshprotocols/2008/04/07/web-20-in-defense-of-editors/">David Crotty</a> started a controversial discussion based on my raised points.<br /><br />First, Crotty said that 'popularity' is a terrible measure of quality. Actually, I think I agree on this! Otherwise, any popular web page would be automatically one with a higher quality. I have some <a href="http://en.wikipedia.org/wiki/Software_metric#Common_software_metrics">software engineering</a> background and would like to take this into account. I would argue that not only the access rate of web pages, but also the cross-linking character between them, or the number of errors per page would be good metrics. Was anyone following the dispute about the <a href="http://www.nature.com/nature/journal/v438/n7070/full/438900a.html">Britannica-Wikipedia</a> comparison (<a href="http://www.nature.com/nature/journal/v440/n7084/full/440582b.html">comment</a>)? Again, what should we compare? Error rate in articles, access rate, or cross-linking rate? Do we not have the same problem for journal <a href="http://en.wikipedia.org/wiki/Impact_factor">impact factors</a>? I think we have to accept that any model (or metric) will never represent the full truth about reality, but it might give us some additional rational for making decisions. Are any molecular modellers still with me? Yes, then I have a question. Do you believe that this metric discussion about <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?term=docking%20scoring%20review&db=pubmed">molecular docking and scoring</a> will ever end? If you think so, what metric or benchmark data set will lead to a fair comparison? I do not think that this is possible, but I strongly believe that the metrics and benchmark data sets around will help us to improve over time. Even using wrong metrics does sometimes help improving metrics, and therefore rationality.<br /><br />Second, I said that "<span style="font-style: italic;">I cannot see how any editorial process can cope with this (<a href="http://arstechnica.com/news.ars/post/20071226-interruptions-info-overload-cost-us-economy-650-billion.html">information overload</a>) problem</span>"; he asks the question "<span style="font-style: italic;">Is he (means me with that) implying that journal editors are not 'really interested' in the papers they’re reviewing?</span>".<br />Not at all! The point I tried to make is that the editorial process will assign reviews to reviewers, which will in most cases do the reviews also in their spare-time. So, at this point it does not matter, if reviewers get assigned by editors or by any other social web system. I think my point of view is not really strong on this, but I believe that <a href="http://en.wikipedia.org/wiki/Transparency_%28humanities%29">missing transparency</a> might be a major source of mistakes of any process, e.g. a <a href="http://scienceblogs.com/pharyngula/2008/02/a_baffling_failure_of_peer_rev.php">peer-review process</a> (via <a href="http://www.spreadingscience.com/2008/04/08/old-versus-new/">spreadingscience</a>). I am not saying that transparency can avoid mistakes, but a transparent process would allow democratic voting systems, which is one type of decision making on Wikipedia. Anyway, in cases where no decision can be reached by user votings or reviewer comments, the next step will be taken by a Wikipedia <a href="http://en.wikipedia.org/wiki/Wikipedia:Mediation">mediator</a> or an editor. The major difference is that in one case the editor has to do the assignment, while in the social software model the dedicated people would come to the editor automatically. And especially in cases where no editor is in place, users would just start editing (<a href="http://en.wikipedia.org/wiki/Wikipedia:Be_bold">be bold!</a>),till one or several users get promoted as pseudo-editor.<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/commons/d/d4/Be_bold.png"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 177px; height: 177px;" src="http://upload.wikimedia.org/wikipedia/commons/d/d4/Be_bold.png" alt="" border="0" /></a>Third, David Crotty says that "<span style="font-style: italic;">you need a better editorial oversight, not less</span>" and he cites Rob Malda (via <a href="http://www.wired.com/entertainment/theweb/news/2007/10/slashdot10year">Wired</a>) "<span style="font-style: italic;">When you're building a system like this (means </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Slashdot">Slashdot</a><span style="font-style: italic;">) you're balancing the wisdom of the crowds versus the tyranny of the mob. Sometimes a crowd is really smart, but some things don't work so well by committee. Crowds work when you have a tightly knit group of people with similar interests</span> ..."<br />Well, again I agree that you need rather more information than less. Though, I would like to challenge this a little. Where is the editor getting his information from? I would assume that he will use at least some social web services or (social) cross-links in journal publications for finding other experts.<br /><br />I was never a friend of black and white views. I would conclude that<br /><ul><li>any editorial process without some social software will run into an information overload problem, and you should not do all of this alone, but share workload.<br /></li><li>any social software process without editors runs into an information overload problem, and how can you discriminate information from noise? You need experts, editors, and some people call this ontology knowledge, or the semantic web.</li><li>a combination of both, editorial expert knowledge in combination with social software is promising for creating massive knowledge networks, e.g. <a href="http://miningdrugs.blogspot.com/2008/06/wikiproteins-may-community-with-you.html">WikiProteins</a>. Finally, any technological process will fail, if the people behind it, are not dedicated enough, run into workload problems, or are hold back by unclear intellectual property situations. So, whatever you do, do it in a team and make it a transparent process. If required, put some license, copyright, intellectual property, or whatever restrictions on it, so make steps towards users and do not wait till users will adapt to your user scenario. In all cases, avoid destroying information and allow a cross-linking between information.<br />As already said on <a href="http://www.chemspider.com/blog/we-need-an-inchikey-resolver-and-we-need-it-now.html">ChemSpider</a>, "<span style="font-style: italic;">get the hell organized...</span>" and talk to each other.</li></ul>Finally, David Crotty and David Bradley, thanks for discussing this in an open-minded setup and via a social software system. I am looking forward to more news, questions, answers and controversial discussions ... at the end it is all about learning and helping people, right?Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-20818634314258542312008-06-01T12:32:00.002+02:002008-06-01T16:31:49.572+02:00WikiProteins - May the community be with you<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.wikiprofessional.org/"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp0.blogger.com/_cUdwVNvWCbc/SEJ7Y04BCpI/AAAAAAAAAFw/EC2rfLFhozI/s320/wikiprofessional.jpg" alt="" id="BLOGGER_PHOTO_ID_5206859785634384530" border="0" /></a><blockquote>"<span style="font-style: italic;">We call on a 'million minds' to annotate a 'million concepts' and to collect facts from the literature with the reward of collaborative knowledge discovery. The system is available for beta testing at </span><a style="font-style: italic;" href="http://www.wikiprofessional.org/">wikiprofessional.org</a>" [DOI <a href="http://dx.doi.org/10.1186/gb-2008-9-5-r89">10.1186/gb-2008-9-5-r89</a>]<br /></blockquote>The author list is impressive (e.g. <a href="http://en.wikipedia.org/wiki/Jimmy_Wales">Jimmy</a>, <a href="http://en.wikipedia.org/wiki/Michael_Ashburner">Prof. Ashburner</a>), as well is the number of communities and organizations (PubMed, Google, Yahoo, <a href="http://en.wikipedia.org/wiki/Uniprot">UniProt</a>). Of course is this not a guarantee for success, but at least is it interesting that those people have started talking to each other. I guess they all want to tacke a challenging scientific problem, which is creating knowledge out of information noise. Very impressive, indeed!<br /><br />I created an account and the system looks very beta at the moment. This early release strategy is very normal, at least for Jimmy, which follows the release soon, release often paradigm of open source communities. When Jimmy released Wikia, he got also some negative feedback, because some people thought it was too early releasing the system. Anyway, this has not stopped us from founding the <a href="http://miningdrugs.blogspot.com/2008/02/yet-another-life-science-informatics-ya.html">Life Science Group</a> on Wikia, though I admit that not too many people have contributed so far. I hope that the WikiProfessional system will be able collecting enough critical brain mass for getting a good head-start.<br /><blockquote>"<span style="font-style: italic;">The first release of WikiProteins contains an embryonic version of what is intended to be developed into a fully functional WikiProfessionals in 2008 and beyond. Users are able to review their pre-constructed (recent) publication list and create their Knowlet before registration. With an increasing number of authors having curated their own Knowlet(s) in the system, creating communities of expertise and indicating their availability for comments and peer review, instant messaging and web conferencing will become available in the system.</span>" [DOI <a href="http://dx.doi.org/10.1186/gb-2008-9-5-r89">10.1186/gb-2008-9-5-r89</a>]</blockquote>The system highlights three major use cases:<br /><ul><li><span style="font-weight: bold;">Community annotation</span>: The basic principle of community annotation is that computers and experts interact in an iterative process of mining and curation.</li><li><span style="font-weight: bold;">Knowledge browsing</span>: This will allow users browsing through the concept space of interesting relationships.</li><li><span style="font-weight: bold;">Collaborative knowledge discovery </span><span style="font-style: italic;">(example)</span>: When the connections in the concept space around antimalarials and tegafur are explored further, it becomes immediately obvious how logical it would be to reason that tegafur might indeed inhibit growth of <a href="http://en.wikipedia.org/wiki/Malaria">malaria</a> parasites, at least in vitro.<br /></li></ul>May the community be with you ...<br /><br /><span style="font-weight: bold;">Reference</span><br /><ul><li><span style="font-style: italic;">Article (macmwdomchmpplbmbwmmrbb08)</span><br />Mons, B.; Ashburner, M.; Chichester, C.; van Mulligen, E.; Weeber, M.; den Dunnen, J.; van Ommen, G.; Musen, M.; Cockerill, M.; Hermjakob, H.; Mons, A.; Packer, A.; Pacheco, R.; Lewis, S.; Berkeley, A.; Melton, W.; Barris, N.; Wales, J.; Meijssen, G.; Moeller, E.; Roes, P.; Borner, K. & Bairoch, A.<br /><span style="font-style: italic;">Calling on a million minds for community annotation in WikiProteins</span><br />Genome Biol, <span style="font-weight: bold;">2008</span>, <span style="font-style: italic;">9</span>, R89. DOI <a href="http://dx.doi.org/10.1186/gb-2008-9-5-r89">10.1186/gb-2008-9-5-r89</a>. PMID <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?term=18507872&db=pubmed%3E">18507872</a></li><li><a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">Six degrees of drug design</a>, Mining Drug Space, <span style="font-weight: bold;">2007</span>-08-17.</li><li><a href="http://miningdrugs.blogspot.com/2007/09/six-sigma-in-drug-design-old-hands-or.html">Six sigma in drug design</a>, Mining Drug Space, <span style="font-weight: bold;">2007</span>-09-15.<br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-41589480608641001212008-05-17T22:45:00.000+02:002008-05-17T23:03:31.311+02:00Lasso, docking, and protonationThe <a href="http://www.simbiosys.ca/">SimBioSys</a> blog posted several literature references and discussions about protonation states<br /><ul><li><a href="http://www.simbiosys.ca/blog/2008/05/16/protonation-states-and-docking/">Discussion 1</a></li><li><a href="http://www.simbiosys.ca/blog/2008/05/17/correct-protonation-state-for-docking/">Discussion 2</a><br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-17993371569760060842008-03-08T10:00:00.011+01:002008-03-10T18:43:19.444+01:00CAS numbers are not public domain, are they?<blockquote>"<span style="font-style: italic;">Work created before the existence of copyright and patent laws also form part of the <a href="http://en.wikipedia.org/wiki/Public_domain">public domain</a>. The </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Bible" title="">Bible</a><span style="font-style: italic;"> and the inventions of </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Archimedes" title="">Archimedes</a><span style="font-style: italic;"> are in the public domain. However, copyright may exist in </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Translation" title="">translations</a><span style="font-style: italic;"> or new formulations of this work.</span>" [<a href="http://en.wikipedia.org/wiki/Public_domain">Wikipedia</a>]<br /></blockquote>As posted by <a href="http://www.chemspider.com/blog/cas-discourages-using-scifinder-to-help-curate-wikipedia-structures-and-cas-numbers.html">Tony</a> is the <a href="http://en.wikipedia.org/wiki/Chemical_Abstracts_Service">Chemical Abstract Service</a> (CAS) discouraging using their CAS services for assigning correct CAS numbers to structures for any third party database. Wikipedia is a source of structures, which is <a href="http://en.wikipedia.org/wiki/Public_domain">public domain</a> due to its <a href="http://en.wikipedia.org/wiki/Wikipedia:Text_of_the_GNU_Free_Documentation_License">GNU FDL</a>. Still, this does not imply that any translation of structures, e.g. CAS numbers, are in the public domain, too. Honestly, this raises a serious problem for <a href="http://en.wikipedia.org/wiki/Wikipedia_talk:WikiProject_Chemistry/CAS_validation">curating CAS numbers</a> on Wikipedia and this raises indeed the question, if they should not be dropped from Wikipedia, and any other information source, at all? Is it not better having no information, than having wrong information?<br /><br />A CAS number is for me only one certain translation of a chemical structure. In this case, the only source and creator for CAS numbers is the <a href="http://en.wikipedia.org/wiki/American_Chemical_Society">American Chemical Society</a>. CAS claims that their services can not be used for curating other data sources. Does this also mean that people can not use CAS numbers from publications?<br /><blockquote>"<span style="font-style: italic;">The public domain can also be defined in contrast to </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Trademarks" class="mw-redirect" title="">trademarks</a><span style="font-style: italic;">. Names, logos, and other identifying marks used in commerce can be restricted as proprietary trademarks for a single business to use. Trademarks can be maintained indefinitely, but they can also lapse through disuse, negligence, or widespread misuse, and enter the public domain. It is possible, however, for a lapsed trademark to become proprietary again, leaving the public domain.</span>" [<a href="http://en.wikipedia.org/wiki/Public_domain">Wikipedia</a>]</blockquote>And does this also mean that scientists are basically not allowed publishing CAS numbers and structures in scientific publications? How can CAS numbers then be used at all, if we can not store this information?<br /><br />Many questions, and who will answer them? And, if they got answered what is the way forward for getting curated structures within the public domain? I would say (again, this time in nicer words): '<a href="http://www.chemspider.com/blog/we-need-an-inchikey-resolver-and-we-need-it-now.html">get organized scientists worldwide!</a>' If CAS can do it, we can do it? It may take longer to get the party started, but if we do not start it will never happen.<br /><blockquote>"<span style="font-style: italic;">An ideal collaborative resource would be designed for large-scale data mining, contain curated historical data, and have data standards and deposition tools that could constantly bring in data from the published literature. ... In other words, the party might take longer to get started than hoped for, but it should be worth the wait.</span>" [M. Baker, DOI <a href="http://dx.doi.org/10.1038/nrd2148">10.1038/nrd2148</a>]<br /></blockquote><span style="font-weight: bold;">See also</span><a href="http://miningdrugs.blogspot.com/2007/01/molecule-mining-review-2006.html"><br /></a><ul><li><a href="http://miningdrugs.blogspot.com/2007/01/molecule-mining-review-2006.html">Molecule mining review</a>, 2006<br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-59567036450152818322008-02-25T19:49:00.002+01:002008-02-26T20:06:44.495+01:00Yet Another Life Science Informatics (YA-LSI) networking portalThere are yet another two new social Life Science Informatics (LSI) networking portals. In contrast to the already available <a href="http://miningdrugs.blogspot.com/2007/06/life-science-informatics-networking.html">LSI networking services</a> are those technologically very mature and offer varieties of contribution and sharing possibilities for users.<br /><ul><li>Wikia: <a href="http://lifescienceinformatics.wikia.com/">Life Science Informatics</a></li><li>Plaxo: <a href="http://lifescienceinformatics.plaxogroups.com/">Life Science Informatics</a></li></ul>Both networking groups are <span style="font-weight: bold;">looking for a stunning LSI logo</span>, so please feel free making any suggestions.Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-11000479983525218002008-02-17T20:07:00.004+01:002008-02-18T00:16:40.726+01:00Social search in drug design and what is Wikia<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.wikia.com/wiki/Wikia"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 195px; height: 70px;" src="http://alpha.search.wikia.com/images/wikia/front-logo.png" alt="" border="0" /></a>Social search engines, another kind of <a href="http://en.wikipedia.org/wiki/Social_software">social software</a>, are trying to include user knowledge for improving search results in the future. One example is <a href="http://www.wikia.com/">Wikia</a> with an announcement of <a href="http://en.wikipedia.org/wiki/Jimmy_Wales">Jimmy Wales</a><blockquote><p><span style="font-style: italic;">Search is part of the fundamental infrastructure of the Internet. And we are making it open source.</span> </p></blockquote>In contrast to the problem described by <a href="http://www.seomoz.org/blog/why-the-vast-majority-of-seos-no-longer-target-individual-search-engines">randfish</a> might this <a href="http://www.mattcutts.com/blog/the-role-of-humans-in-google-search/">improve search quality</a> and reduce search engine tweaking requirements by users. Sometimes users have e.g. to multiply search terms or to find other <a href="http://www.informit.com/articles/printerfriendly.aspx?p=675528">tweaks</a>, for getting the search results you want. Early initiatives like the the <a href="http://www.dmoz.org/">open directory project DMOZ</a> (<a href="http://en.wikipedia.org/wiki/Open_Directory_Project">@Wikipedia</a>) are using already user organized classification schemes for web links. <a href="http://swik.net/">SWiK</a> organizes open source project information in a Wiki-like manner.I am still a little bit puzzled to understand what the actual difference to <a href="http://en.wikipedia.org/wiki/Social_bookmarking">social bookmarking</a> systems is. Are social bookmarking services not already ranking the relevance of web links?Lets do a test on some <a href="http://en.wikipedia.org/wiki/Drug_design">drug design</a> search terms. I added the tags E(experts/users), R(anking), M(ini information page). Here a '+' indicates that this feature is available a '-' means that this feature is disabled at the moment (e.g. in Wikia) or not available (yet).<div><table><tbody><tr><td><b>Term</b></td><td><a href="http://wikia.com/"><b>Wikia (+-E,+-R)</b></a></td><td><a href="http://del.ico.us/"><b>del.ico.us (+E,-R)</b></a></td><td><a href="http://www.blogger.com/post-create.g?blogID=26255983"><b>Digg (+E,+R)</b></a></td><td><a href="http://www.blogger.com/post-create.g?blogID=26255983"><b>Technorati (+E,+R)</b></a></td><td><a href="http://www.blogger.com/post-create.g?blogID=26255983"><b>CiteULike (+E,+R)</b></a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Data_mining">Data mining</a></td><td><a href="http://re.search.wikia.com/search#Data%20mining">36247-M</a></td><td><a href="http://del.ico.us/tag/Data+mining">yes</a></td><td><a href="http://www.digg.com/search?s=Data+mining&submit=Search&section=all&type=both&area=all&sort=most">615</a></td><td><a href="http://technorati.com/search/Data+mining?authority=a4&language=en">14630</a></td><td><a href="http://www.citeulike.org/search/all?q=Data+mining">496</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Drug_design">Drug design</a></td><td><a href="http://re.search.wikia.com/search#Drug%20design">34964-M</a></td><td><a href="http://del.ico.us/tag/Drug+design">yes</a></td><td><a href="http://www.digg.com/search?s=Drug+design&submit=Search&section=all&type=both&area=all&sort=most">285</a></td><td><a href="http://technorati.com/search/Drug+design?authority=a4&language=en">14966</a></td><td><a href="http://www.citeulike.org/search/all?q=Drug+design">493</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Bioinformatics">Bioinformatics</a></td><td><a href="http://re.search.wikia.com/search#Bioinformatics">16937+M</a></td><td><a href="http://del.ico.us/tag/Bioinformatics">yes</a></td><td><a href="http://www.digg.com/search?s=Bioinformatics&submit=Search&section=all&type=both&area=all&sort=most">75</a></td><td><a href="http://technorati.com/search/Bioinformatics?authority=a4&language=en">4698</a></td><td><a href="http://www.citeulike.org/search/all?q=Bioinformatics">496</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Structural_biology">Structural biology</a></td><td><a href="http://re.search.wikia.com/search#Structural%20biology">7925-M</a></td><td><a href="http://del.ico.us/tag/Structural+biology">yes</a></td><td><a href="http://www.digg.com/search?s=Structural+biology&submit=Search&section=all&type=both&area=all&sort=most">60</a></td><td><a href="http://technorati.com/search/Structural+biology?authority=a4&language=en">1758</a></td><td><a href="http://www.citeulike.org/search/all?q=Structural+biology">499</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Organic_synthesis">Organic synthesis</a></td><td><a href="http://re.search.wikia.com/search#Organic%20synthesis">4403-M</a></td><td><a href="http://del.ico.us/tag/Organic+synthesis">yes</a></td><td><a href="http://www.digg.com/search?s=Organic+synthesis&submit=Search&section=all&type=both&area=all&sort=most">15</a></td><td><a href="http://technorati.com/search/Organic+synthesis?authority=a4&language=en">1979</a></td><td><a href="http://www.citeulike.org/search/all?q=Organic+synthesis">498</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Medicinal_chemistry">Medicinal chemistry</a></td><td><a href="http://re.search.wikia.com/search#Medicinal%20chemistry">3216-M</a></td><td><a href="http://del.ico.us/tag/Medicinal+chemistry">yes</a></td><td><a href="http://www.digg.com/search?s=Medicinal+chemistry&submit=Search&section=all&type=both&area=all&sort=most">45</a></td><td><a href="http://technorati.com/search/Medicinal+chemistry?authority=a4&language=en">1686</a></td><td><a href="http://www.citeulike.org/search/all?q=Medicinal+chemistry">498</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Kinase">Kinase</a></td><td><a href="http://re.search.wikia.com/search#Kinase">2843-E-M</a></td><td><a href="http://del.ico.us/tag/Kinase">yes</a></td><td><a href="http://www.digg.com/search?s=Kinase&submit=Search&section=all&type=both&area=all&sort=most">30</a></td><td><a href="http://technorati.com/search/Kinase?authority=a4&language=en">1248</a></td><td><a href="http://www.citeulike.org/search/all?q=Kinase">498</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Molecular_modelling">Molecular modelling</a></td><td><a href="http://re.search.wikia.com/search#Molecular%20modelling">2261-M</a></td><td><a href="http://del.ico.us/tag/Molecular+modelling">yes</a></td><td><a href="http://www.digg.com/search?s=Molecular+modelling&submit=Search&section=all&type=both&area=all&sort=most">30</a></td><td><a href="http://technorati.com/search/Molecular+modelling?authority=a4&language=en">208</a></td><td><a href="http://www.citeulike.org/search/all?q=Molecular+modelling">491</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/Cheminformatics">Cheminformatics</a></td><td><a href="http://re.search.wikia.com/search#Cheminformatics">460+M</a></td><td><a href="http://del.ico.us/tag/cheminformatics">yes</a></td><td><a href="http://www.digg.com/search?s=Cheminformatics&submit=Search&section=all&type=both&area=all&sort=most">1</a></td><td><a href="http://technorati.com/search/Cheminformatics?authority=a4&language=en">147</a></td><td><a href="http://www.citeulike.org/search/all?q=Cheminformatics">60</a></td></tr><tr><td><a href="http://en.wikipedia.org/wiki/GPCR">GPCR</a></td><td><a href="http://re.search.wikia.com/search#GPCR">252-E-M</a></td><td><a href="http://del.ico.us/tag/GPCR">yes</a></td><td><a href="http://www.digg.com/search?s=GPCR&submit=Search&section=all&type=both&area=all&sort=most">1</a></td><td><a href="http://technorati.com/search/GPCR?authority=a4&language=en">121</a></td><td><a href="http://www.citeulike.org/search/all?q=GPCR">497</a></td></tr></tbody></table></div>Wikia is still an <a href="http://en.wikipedia.org/wiki/Software_release_life_cycle">alpha version</a> and user ranking of links seems to be somehow available, but does not work (yet). Though, I like the general appearance and hope to see there some new features soon. Using del.ico.us as search engine is just useless in my opinion, because I have not found any way ranking the results. Digg might be in theory useful, but lacks scientific content. Technorati is rather a news alerting service than a search engine. CiteULlike points towards scientific articles, but why are all categories delivering less than 500 hits here?Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-19116649002685377842008-02-11T00:26:00.000+01:002008-02-11T00:59:44.072+01:00Pipeline pilot: Student edition<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.accelrys.com/products/scitegic/pp-student/academic-web-banner.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 427px; height: 204px;" src="http://www.accelrys.com/products/scitegic/pp-student/academic-web-banner.jpg" alt="" border="0" /></a>As posted by <a href="http://ccl.net/cgi-bin/ccl/message-new?2008+02+08+003">Richard Compton</a> on <a href="http://ccl.net/">CCL.net</a> provides <a href="http://www.scitegic.com/">SciTegic</a> (part of Accelrys) a <a href="http://www.accelrys.com/products/scitegic/pp-student/">student version</a> of Pipeline Pilot (PLP), which is free for academics ! I seriously like the thinking of those SciTegic guys, they are sharp, critical, and very trendy. This is a good starting point for having a hell of a discussion about science. Smart move, which is following the integration principle !<br /><br />This shows another fantastic example of a smart licensing strategy, which will probably always be critically discussed in the drug design area? Just recently there was a discussion on CCL and I <a href="http://ccl.net/cgi-bin/ccl/message-new?2008+02+04+010">commented on it</a>. Anyway, this particular case will support science and allows scientists integrating their tools on a global level. Furthermore it allows sharing tools with industrial partner without forcing them creating an additional interface layer.<br /><br />Beside, is this really kicking-ass news for everybody involved in <a href="http://www.knime.org/">KNIME</a>, since people in academia can now support PLP directly (if they want). KNIME provides already lots of toolboxes, but compared to PLP this number is still small. Just check them out and especially the high number of different ways of integrating data, processing data, and integrating external tools with different languages.<br /><ul><li><a href="http://www.scitegic.com/products/chemistry/index.html">Chemistry</a></li><li><a href="http://www.scitegic.com/products/ADMET/index.html">ADME/Tox</a></li><li><a href="http://www.scitegic.com/products/chemmining/">ChemMining<span class="new"></span></a></li><li><a href="http://www.scitegic.com/products/DT/index.html">Decision Trees </a></li><li><a href="http://www.scitegic.com/products/gene_expression/">Gene Expression<span class="new"></span></a></li><li><a href="http://www.scitegic.com/products/materials_studio/">Materials<span class="new"></span></a></li><li><a href="http://www.scitegic.com/products/modeling/index.html">Modeling</a></li><li><a href="http://www.scitegic.com/products/rstatistics/index.html">R Statistics</a></li><li><a href="http://www.scitegic.com/products/reporting/index.html">Reporting</a></li><li><a href="http://www.scitegic.com/products/imaging/">Imaging Collection<span class="new"></span></a></li><li><a href="http://www.scitegic.com/products/SAC/index.html">Sequence Analysis</a></li><li><a href="http://www.scitegic.com/products/TAC/index.html">Text Analytics</a></li><li><a href="http://www.scitegic.com/products/catalyst/">Catalyst</a></li><li><a href="http://www.scitegic.com/products/charmm/">CHARMm</a></li><li><a href="http://www.scitegic.com/products/integration/index.html">Integration Collection</a></li><li><a href="http://www.scitegic.com/products/sdks/index.html">Client-Side SDKs</a><br /></li><li><a href="http://www.scitegic.com/products/webport/index.html">Pipeline Pilot Web Port</a></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-73185585395635212192008-02-04T23:38:00.000+01:002008-02-09T01:23:36.895+01:00Chemical webs, nets, and spidersI love it, when people come together and try to understand why particular problems exist. In <a href="http://miningdrugs.blogspot.com/2007/09/social-chemistry-and-wikipedia.html">September 2007</a> we tried contacting several people for solving the problem of having proper chemical indexing or/and search services within Wikipedia. Some people replied, but there was still this strange feeling that there is not only one responsible person, but many. How can you adress a problem if nobody is really in charge and how to take decisions in such situations? Well, maybe spreading the news had some effect, because some things happened since then.<br /><ol><li><a href="http://www.linkedin.com/in/antonywilliams">Tony</a> (aka <a href="http://en.wikipedia.org/wiki/User:ChemSpiderMan">ChemSpiderMan</a>) had a discussion with <a href="http://en.wikipedia.org/wiki/User:Walkerma">Walkerma</a> from the chemistry group on Wikipedia. It turned out that both share some common interests, which is providing <a href="http://www.chemconnector.com/chemunicating/dedicating-christmas-time-to-the-cause-of-curating-wikipedia.html">curated chemical data</a>. This is a noble goal and <a href="http://www.chemspider.com/docs/wikipedia/Wikpiedia_Review_project_010808.pdf">as summarized</a> not an easy one, especially not because there exist already some <a href="http://en.wikipedia.org/wiki/Wikipedia:Chemical_infobox/Workshop">chemistry boxes</a> at Wikipedia.<br /></li><li>Tony and some other people from the advisory board are showing interests in creating <a href="http://www.chemspider.com/blog/wichempedia-is-now-on-its-way.html">WiChempedia</a>, which has also one major goal, "<span style="font-style: italic;">what can be done to facilitate delivery of quality information to the chemistry community</span>?".</li><li><a href="http://en.wikipedia.org/wiki/User:Walkerma">Walkerma</a> initiated <a href="http://www.chemspider.com/blog/a-request-from-martin-walker-at-wikipedia.html">a regular discussion</a> on <a href="http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Chemistry/IRC_discussions">Wikipedia (via IRC)</a> for helping to understand the technical and scientific issues for incorrect chemical information on Wikipedia. Though I must say: I do not like the meeting time, which is during business hours and not practical for me.<br /></li><li>Tony is not only moving on with extending his net of dedicated persons, but also attracting new <a href="http://www.chemspider.com/blog/chemspider-becomes-a-connector-to-chemical-companies.html">chemical sources</a>, and spreading <a href="http://www.iupac.org/publications/ci/2008/3001/ic_chemspider.html">publications</a>. Looks like he is a professional advocat and knows what he is doing.<br /></li><li>Beside all of this has Chemspider itself updated and extended his <a href="http://www.chemspider.com/blog/the-progress-of-structure-deposition-on-chemspider.html">structure deposition</a> process (<a href="http://www.chemspider.com/docs/Single_Structure_Depositions_on_ChemSpider.pdf">PDF slides</a>), which allow people curating data there directly.<br /></li></ol>I am usually not easily impressed, but I must say that what was done so far is a great piece of work. I do not know what others think about it, but knowing the pros and cons of social networks, it is fun to see that ChemSpider is moving with its net, and not being hold back by it !Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-3713396038769227582008-01-06T01:09:00.000+01:002008-01-06T21:20:01.437+01:00ChemSpider search on iGoogleExtending the <a href="http://miningdrugs.blogspot.com/2007/01/emolecules-and-leo-searches-on-google.html">PubChem and eMolecules</a> gadgets, I created a <a href="http://www.joergkurtwegner.de/tools/chemspider_searchbar.xml">simple interface</a> for a basic <a href="http://www.chemspider.com/">ChemSpider</a> interface for <a href="http://en.wikipedia.org/wiki/IGoogle">iGoogle</a>.<br /><script src="http://gmodules.com/ig/ifr?url=http://www.joergkurtwegner.de/tools/chemspider_searchbar.xml&up_resultdisplay=_parent&synd=open&w=320&h=30&title=ChemSpider+search&border=%23ffffff%7C0px%2C1px+solid+%23004488%7C0px%2C1px+solid+%23005599%7C0px%2C1px+solid+%230077BB%7C0px%2C1px+solid+%230088CC&output=js"></script><br /><br />I noticed a few issues and hope that those might get fixed<br /><ul><li>The <a href="http://www.iupac.org/inchi/release102.html">InChiKey</a> search on ChemSpider does not support searching only for the structure without stereochemical information, which would be just the first part of the InChIKey: structure-stereoinfo. So, I hope his could get fixed.<br /></li><li>I tried finding more search options, e.g. defining substructure or exact searches, but could not find them in the ChemSpider tutorial. If there is more detailed information around I would appreciate getting some hints.</li></ul>And just for clarity, here is a list of the other gadgets I have created<br /><script src="http://gmodules.com/ig/ifr?url=http://www.joergkurtwegner.de/tools/doi_searchbar.xml&up_resultdisplay=_parent&synd=open&w=320&h=30&title=DOI+search&border=%23ffffff%7C0px%2C1px+solid+%23004488%7C0px%2C1px+solid+%23005599%7C0px%2C1px+solid+%230077BB%7C0px%2C1px+solid+%230088CC&output=js"></script><br /><script src="http://gmodules.com/ig/ifr?url=http://www.joergkurtwegner.de/tools/emolecules_searchbar.xml&up_searchtype=ss&up_resultdisplay=_parent&synd=open&w=320&h=30&title=eMolecules+search&border=%23ffffff%7C0px%2C1px+solid+%23004488%7C0px%2C1px+solid+%23005599%7C0px%2C1px+solid+%230077BB%7C0px%2C1px+solid+%230088CC&output=js"></script><br /><script src="http://gmodules.com/ig/ifr?url=http://www.joergkurtwegner.de/tools/leo_searchbar.xml&up_lp=ende&up_resultdisplay=_parent&synd=open&w=320&h=50&title=LEO+search&border=%23ffffff%7C0px%2C1px+solid+%23004488%7C0px%2C1px+solid+%23005599%7C0px%2C1px+solid+%230077BB%7C0px%2C1px+solid+%230088CC&output=js"></script><br /><script src="http://gmodules.com/ig/ifr?url=http://www.joergkurtwegner.de/tools/ncbi_searchbar.xml&up_resultdisplay=_parent&synd=open&w=320&h=50&title=NCBI+search&border=%23ffffff%7C0px%2C1px+solid+%23004488%7C0px%2C1px+solid+%23005599%7C0px%2C1px+solid+%230077BB%7C0px%2C1px+solid+%230088CC&output=js"></script>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-42461496349490762162008-01-05T22:15:00.000+01:002008-01-06T19:41:36.038+01:00Boutique drugs: Splitting mass market and drug targets<blockquote>"<span style="font-style: italic;">Recently, the term boutique has started being applied to normally-mass-market items that are either niche or produced in intentionally small numbers at very high prices. For example, before the release of the </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Wii" title="Wii">Wii</a><span style="font-style: italic;">, a </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Time_Magazine" title="Time Magazine">Time Magazine</a><span style="font-style: italic;"> article suggested that </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Nintendo" title="Nintendo">Nintendo</a><span style="font-style: italic;"> could become a boutique video-game company, producing games for niche audiences, rather than trying to compete directly with </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Microsoft" title="Microsoft">Microsoft</a><span style="font-style: italic;"> and </span><a style="font-style: italic;" href="http://en.wikipedia.org/wiki/Sony" title="">Sony</a><span style="font-style: italic;"> </span><a style="font-style: italic;" href="http://www.time.com/time/magazine/article/0,9171,501031215-557120,00.html" class="external autonumber" title="http://www.time.com/time/magazine/article/0,9171,501031215-557120,00.html" rel="nofollow">[1]</a>" [<a href="http://en.wikipedia.org/wiki/Boutique">Boutique@WP</a>]<br /><br />"<span style="font-style: italic;">articles intended for use in the diagnosis, cure, mitigation, treatment, or prevention of disease in man or other animals and articles (other than food)</span><span style="font-style: italic;"> intended to affect the structure or any function of the body of man or other animals.</span>" [<a href="http://www.fda.gov/opacom/laws/fdcact/fdcact1.htm">FDA definition</a>]<br /></blockquote>As found via <a href="http://www.bio-itworld.com/science-technology/drug-discovery">Bio-IT/drug discovery</a> expect analysts in the future more <a href="http://www.washingtonpost.com/wp-dyn/content/article/2007/12/30/AR2007123001043.html">boutique drugs</a>. Two main causes are mentioned.<br /><br />First, the regulatory hurdles for getting a drug on the market have dramatically hardened since 2004. Furthermore has the number of <a href="http://en.wikipedia.org/wiki/Black_box_warning">black-box warnings</a> (<a href="http://www.fda.gov/cber/gdlns/boxwarlb.htm">FDA guidance</a>) at least doubled since that time.<br />Second, is there still a <a href="http://miningdrugs.blogspot.com/2007/11/commenting-on-grim-days-for-pharma.html">dwindling pipeline</a> for new medications.<br /><br />And I think Garnier and Brozak have the points when saying that it is difficult to react in a short time to many new drug projects/targets:<br /><blockquote>"<span style="font-style: italic;">The hurdle has been raised, there's no question about it! What is unclear is exactly how we should modify our drug development plans to meet this higher hurdle.</span>" [J. P. Garnier]<br />"<span style="font-style: italic;">No pharmaceutical executive has ever lost his job for saying no to a new drug project. And no FDA employee has ever lost his job for saying no to a new drug application.</span>" [S. Brozak]<br /></blockquote>Anyway, with a recent sad history for scientists it is to expect that <a href="http://miningdrugs.blogspot.com/2007/11/commenting-on-grim-days-for-pharma.html">drug design faces additional challenges</a> in the next years, which are: <a href="http://en.wikipedia.org/wiki/Paradigm">changing drug design pradigms</a>, avoiding <a href="http://en.wikipedia.org/wiki/Paralysis">paralysis</a>.<br /><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp1.blogger.com/_cUdwVNvWCbc/R4AINWTW2hI/AAAAAAAAAFQ/_54G4lZRuAU/s1600-h/drug_design_paralysis.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_cUdwVNvWCbc/R4AINWTW2hI/AAAAAAAAAFQ/_54G4lZRuAU/s200/drug_design_paralysis.gif" alt="" id="BLOGGER_PHOTO_ID_5152126999129938450" border="0" /></a>I think it is possible, but I must say that <a href="http://miningdrugs.blogspot.com/2007/11/andy-grove-when-silicon-indoctrinates.html">some ideas</a> are coming from people knowing the business, but not the science! On the other hand do "scientists, taken as a class, not always turn into the best managers" (via <a href="http://pipeline.corante.com/archives/2008/01/03/top_of_the_heap.php">in the pipeline</a>). Especially in this setup, changing seems difficult, because many are used to an established "blockbuster" drug design setup, which has worked for decades. Now, they have to define a way-out facing an <a href="http://miningdrugs.blogspot.com/2007/05/there-are-too-many-silos-and-often.html">innovaton funnel</a> problem as well. If some of us scientists survive the next years, we can tell our grandchilds how some people and companies were able creating <a href="http://miningdrugs.blogspot.com/2006/10/disruptive-technology-in-drug-design.html">disruptive innovation</a> helping patients in the upcoming decades.<span style="font-weight: bold;"><br /><br />References</span><br /><ul><li>Flat, round pills <a href="http://en.wikipedia.org/wiki/Image:FlattenedRoundPills.jpg">@WP</a></li><li>No smoking sign <a href="http://en.wikipedia.org/wiki/Image:No_smoking_sign.svg">@WP</a><br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-61191081518512140582007-12-22T19:30:00.000+01:002007-12-24T12:44:13.324+01:00A community moves on, merry XMas and fantastic New Year 2008As posted by <a href="http://mndoci.com/blog/2007/12/21/the-programmable-scientific-web/">Deepak</a> and <a href="http://chem-bla-ics.blogspot.com/2007/12/christmas-presents.html">Egon</a> have several chem-/bio-informatics community members reached another milestone by publishing about <a href="http://dx.doi.org/10.1186/1471-2105-8-487">userscripts in life sciences</a>. In short, I am a believer in this <a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">technology</a>. When reading, I got a little bit disappointed, because I got left with the feeling that I start loosing the touch to the community, though I try to follow and <a href="http://userscripts.org/scripts/review/6807">contribute</a> as much as possible.<br /><br />Loosing grip became also clear when I read the posts of <a href="http://bioclipse.blogspot.com/2007/12/towards-standards-for-qsar-data-setup.html">Ola</a> and <a href="http://chem-bla-ics.blogspot.com/2007/12/molecular-qsar-descriptors-in-cdk.html">Egon</a>. I know that the community needs a <a href="http://depth-first.com/articles/2007/09/05/name-that-graph-revealed-oligarchy-2-0">self-selecting oligarchy</a> for moving forward. You guys meet regularly in workshops and user group meetings. Some people might not have the time for this. In other words, you are part of the hard-core development team. This means, if you are not explaining the problems in more details and post more information somewhere, you might exclude some people, which do not have the time anymore digging trough the code themself.<br /><br />As said earlier, I am working now in <a href="http://en.wikipedia.org/wiki/Drug_design">drug design</a> with <a href="http://pipeline.corante.com/">all the good and bad things</a> about it (just read a few of Derek's posts and comments). This means my focus lies now on <a href="http://en.wikipedia.org/wiki/Lead_compound">lead optimization</a>, <a href="http://miningdrugs.blogspot.com/2007/02/anti-hiv-drug-prezista-darunavir-45.html">reading a lot</a>, <a href="http://miningdrugs.blogspot.com/2007/05/literature-management.html">literature management</a>, and <a href="http://en.wikipedia.org/wiki/Data_mining">data mining</a>/<a href="http://miningdrugs.blogspot.com/2007/05/recent-advances-in-chemoinformatics.html">graph mining</a>, and only partially source code (if I can not call any support). This just means that I am very busy with many things, some of them I will and can not talk about, and this is good for my company community. As said before, data is <a href="http://miningdrugs.blogspot.com/2007/10/open-data-closed-data-or-question-about.html">not closed in general</a>, but people have to accept some policies and confidentiality agreements. If you really want it, you have to talk to the right people and you need to have some very convincing arguments why you need this data.<br /><br />While talking about policies, congratulations to Antony <a href="http://www.chemspider.com/blog/?p=312">driving collaborations</a> on several levels.<br /><br />If there is something which helps people like me, for finding relevant information quickly, then that are two things:<br /><ul><li>userscripts, or in general social software, helping me finding information quickly</li><li>open standard databases, which allow people to contribute on several levels. Please note that I still think that open standards are more important than open access. 'Open' is for people in industry just a question of pricing, though I must say that some pricing strategies are just ridiculous and couterproductive for the work of scientists.<br /></li></ul><br /><span style="font-weight: bold;">References</span><br /><ul><li>E. L. Willighagen, N. M. O'Boyle, H. Gopalakrishnan, D. Jiao, R. Guha, C. Steinbeck, D. J. Wild, <span style="font-style: italic;">Userscripts for the Life Sciences</span>, Bioinformatics, <span style="font-weight: bold;">2007</span>, <span style="font-style: italic;">8</span>, 487.<br />DOI <a href="http://dx.doi.org/10.1186/1471-2105-8-487">10.1186/1471-2105-8-487</a><br /><br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-67489035695238783322007-12-02T22:27:00.000+01:002007-12-02T22:37:19.588+01:00Too many candles - World AIDS day 2007<blockquote>"<span style="font-style: italic;">Although treatments for AIDS and HIV exist to decelerate the virus' progression, there is currently no known cure.</span>" [<a href="http://en.wikipedia.org/wiki/AIDS">AIDS@Wikipedia</a>]</blockquote>Help fighting AIDS !<br /><object height="255" width="325"><param name="movie" value="http://www.youtube.com/v/vGcnc2tgjZI&rel=1"><param name="wmode" value="transparent"><embed src="http://www.youtube.com/v/vGcnc2tgjZI&rel=1" type="application/x-shockwave-flash" wmode="transparent" height="255" width="325"></embed></object><br />source: <a href="http://www.amfar.org/">amfAR</a>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-33476101880288917512007-11-25T18:11:00.000+01:002007-11-25T18:16:15.742+01:00MetWare: metabolomics databasevia <a href="http://chem-bla-ics.blogspot.com/2007/11/metware-metabolomics-database-project.html">chem-bla-ics</a><br /><blockquote style="font-style: italic;">The Applied Bioinformatics at <a href="http://www.pri.wur.nl/NL/">PRI</a> group in <a href="http://en.wikipedia.org/wiki/Wageningen">Wageningen</a> and the group of <a href="http://www.ipb-halle.de/de/forschung/stress-und-entwicklungsbiologie/forschungsgruppen/bioinformatik-massenspektrometrie/">Steffen Neumann</a> in Halle have started the <a href="http://metware.sf.net/">MetWare</a> project on <a href="http://sf.net/">Sourceforge</a> to develop opensource databases for metabolomics data.<br /><br />The databases design will be based on and ideally compatible with proposed standards like ArMet (DOI:<a href="http://dx.doi.org/10.1038/nbt1041">10.1038/nbt1041</a>) and those recently written up by the <a href="http://msi-workgroups.sourceforge.net/">Metabolomics Standards Initiative</a> (see the issue around DOI:<a href="http://dx.doi.org/10.1007/s11306-007-0070-6">10.1007/s11306-007-0070-6</a>).<br /><br />One important design goal is that the project will use <a href="http://www.biomart.org/">BioMart</a>, which will allow easy integration of the database content in data analysis programs like <a href="http://taverna.sf.net/">Taverna</a> and <a href="http://www.r-project.org/">R</a> using the <a href="http://www.bioconductor.org/packages/2.1/bioc/html/biomaRt.html">biomaRt</a> package (see DOI:<a href="http://dx.doi.org/10.1093/bioinformatics/bti525">10.1093/bioinformatics/bti525</a>).</blockquote>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-14143882269624513862007-11-08T23:14:00.000+01:002007-11-08T23:29:14.283+01:00Once upon a cyclopentyl a cyclohexyl followed, and they lived not happily ever after ...via <a href="http://ashutoshchemist.blogspot.com/2007/11/what-doth-ch2.html">The Curious Wavefunction</a><br /><blockquote style="font-style: italic;">One common strategy used by medicinal chemists for enhancing potency is to add a methylene carbon in a ring that's fitting into a lipophilic pocket. A methylene group usually contributes about 4 kJ/mol of affinity....or maybe not.<br /><br /><a href="http://pc1664.pharmazie.uni-marburg.de/">Gerhard Klebe's</a> group at Marburg are studying inhibitors of thrombin in which they replace a cyclopentyl group by a cyclohexyl group. To their surprise they <a href="http://www3.interscience.wiley.com/cgi-bin/abstract/116324543/ABSTRACT">observe</a> (DOI: <a href="http://dx.doi.org/10.1002/anie.200701169">10.1002/anie.200701169</a>) no change in binding affinity. Crystallography indicates no electron density for the part occupied by the cyclohexyl group but robust density for the cyclopentyl. The authors conclude that while the cyclohexyl binding is enthalpically unfavourable, entropically the six-membered ring can flip and twist and dance, which is favourable, and also does not provide much crystal density because of its flexibility. So unfavourable enthalpy is balanced by favourable entropy. MD simulations support this contention.<blockquote>"What lesson can be learned from this example? Usually ligands are optimized in congeneric series. The addition of functional groups is expected to enhance binding; for example, the change from a five- to a six-membered ring should augment binding affinity by approximately 3–4 kJ/mol. However, even very similar ligands can exhibit very different binding properties that destroy a simple structure–activity relationship"</blockquote>Also, the gain in binding affinity from a methylene depends on the system. Model systems based on octanol-water partitioning offer a value of about 0.7 kcal/mol. But of course octanol unlike a protein does not have a problem reorienting itself around the substrate. Thus, the value can be less for a protein. On the other hand, in cases where an extra methylene or methyl just about fills the remaining space in a pocket and packs tightly can provide 5 times as much binding affinity. This is true for example in case of tRNA synthatases (I got this from Alan Fersht's excellent <a href="http://www.amazon.com/Structure-Mechanism-Protein-Science-Catalysis/dp/0716732688/ref=pd_bbs_sr_1/102-4750131-6367354?ie=UTF8&s=books&qid=1194295858&sr=1-1">book</a>) As usual, exceptions abound to the generalisations, and Klebe's study provides another interesting example.</blockquote><span style="font-weight: bold;">References</span><br /><ul><li>Cyclohexyl <span class="chem:smiles">C1CCCCC1<span></span></span></li><li>Cyclopentyl <span class="chem:smiles">C1CCCC1</span></li><li><span class="chem:smiles">C. Gerlach, M. Smolinski, H. Steuber, C. A. Sotriffer, A. Heine, D. G. Hangauer, G. Klebe, <span style="font-style: italic;">Thermodynamic Inhibition Profile of a Cyclopentyl and a Cyclohexyl Derivative towards Thrombin: The Same but for Different Reasons</span>, Angewandte Chemie International Edition, <span style="font-style: italic;">46</span>,8511-8514, <span style="font-weight: bold;">2007</span>. DOI <a href="http://dx.doi.org/10.1002/anie.200701169">10.1002/anie.200701169</a><br /><br /></span></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-63148066968600267612007-11-07T22:11:00.000+01:002007-11-07T22:25:07.867+01:00Andy Grove: When silicon indoctrinates carbon<a href="http://www.newsweek.com/id/68221">Andy Grove</a> is right and we, the drug design community, can learn something from the chip industry. <a href="http://pipeline.corante.com/archives/2007/11/06/andy_grove_rich_famous_smart_and_wrong.php">Derek, can you not see it, too?</a> This strong relationship between those two families ...<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/commons/6/64/Electronic_component_transistors.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 200px;" src="http://upload.wikimedia.org/wikipedia/commons/6/64/Electronic_component_transistors.jpg" alt="" border="0" /></a><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/commons/2/28/Expecting_family.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 200px;" src="http://upload.wikimedia.org/wikipedia/commons/2/28/Expecting_family.jpg" alt="" border="0" /></a><blockquote>"<span style="font-style: italic;">If you want to understand why something happens in business, study the disk drive industry. Those companies are the closest things to fruit flies that the business world will ever see. Drug design is a process between 9 to 15 years! So, which object to study lies in-between a fruit fly and a hard-disk? A high-throughput screening, a biological assay or an 'in silico' 3D/2D/xD model of a drug?</span>" [<a href="http://miningdrugs.blogspot.com/2006/10/disruptive-technology-in-drug-design.html">Mining Drug Space</a>]<br /></blockquote>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-44735493502558351662007-11-03T02:35:00.000+01:002007-11-03T10:40:31.891+01:00Commenting on: Grim days for pharmavia <a href="http://mndoci.com/blog/2007/11/02/grim-days-for-pharma/">business|bytes|genes|molecules</a><br /><p></p><blockquote style="font-style: italic;"><p>Bloomberg says that drug sales are forecast to grow at their <a href="http://www.bloomberg.com/apps/news?pid=20601202&sid=aImshDeG9tDs&refer=healthcare">slowest ever rate</a> in 44 years.</p> <p>The <a href="http://en.wikipedia.org/wiki/The_Wall_Street_Journal">WSJ</a> reminds us that <a href="http://blogs.wsj.com/health/2007/11/01/drug-drought-deepens-as-fda-approvals-lag/">FDA approvals for NMEs are slow</a>, and on a downward curve.</p> <p>Perhaps it’s time for the pharma industry to really think about <a href="http://mndoci.com/blog/2007/10/29/trendspotting-the-future-of-biopharma/">Pharma Futorology</a></p></blockquote>Some of <a href="http://pipeline.corante.com/archives/2007/10/24/come_on_improve_already.php">the consequences</a> have been announced, and I think everybody hopes that those things will not happen every single year!<br /><br />Can innovation help us for making drug design a <a href="http://www.blogger.com/dx.doi.org/10.1038/nbt0807-945">mission possible</a>? If yes, what are potential key factors for success?<br /><ul><li><a href="http://miningdrugs.blogspot.com/2007/05/there-are-too-many-silos-and-often.html">Aligning scientists and goals</a></li><ul><li>Putting scientists in the driver's seat</li><li>Goal congruence</li><li>Positive self-image</li><li>Fair evaluation system for high-performing scientist</li></ul><li>Industral cultures that <a href="http://miningdrugs.blogspot.com/2006/10/disruptive-technology-in-drug-design.html">foster innovation</a></li><ul><li>Take calculated risks</li><li>Openness and transparency</li><li>Adapt to constant changes</li><li>Facilitate effective collaboration</li></ul></ul><span style="font-weight: bold;">References</span><br /><ul><li>X. Zhong, G. B. Moseley, <span style="font-style: italic;">Mission possible: managing innovation in drug discovery</span>, Nature Biotechnology, <span style="font-weight: bold;">2007</span>, <span style="font-style: italic;">25</span>, 945-946. DOI <a href="http://www.blogger.com/dx.doi.org/10.1038/nbt0807-945">10.1038/nbt0807-945</a></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-52756341696795674082007-11-02T23:07:00.000+01:002007-11-02T23:23:09.557+01:00Quotes from the neighbors (November 2007)==<a href="http://mndoci.com/blog/2007/10/30/the-value-of-data-extraction/">The value of feature extraction</a>==<br /><blockquote>I am a strong believer in the power of machine readable data, and the potential power it can provide. Whether it be a system like <a href="http://freebase.com/">Freebase</a> that allows you to add structure to data, or the kinds of entity extraction and data contexts that <a href="http://blog.jonudell.net/">Jon Udell</a> and <a href="http://jeffjonas.typepad.com/">Jeff Jonas</a> talk about, making your data more “intelligent” is something that we should strive for, whether it be in the world of business, or the world of life science (or as <a href="http://www.lexical.org.uk/">Andrew Walkingshaw</a> might point out, materials science). My old colleagues at <a href="http://scitegic.com/">Scitegic</a> have a motto - “Ask more of your data”. That has always resonated with me. So lets take yet another look at how to make your data smarter, especially when the semantic web seems to be inching towards more mainstream acceptance.<br /></blockquote><br />==<a href="http://depth-first.com/articles/2007/10/29/compiling-the-inchi-toolkit-to-pure-java-bytecode-with-nestedvm">Compiling the InChI Toolkit to Pure Java Bytecode with NestedVM</a>==<br /><blockquote>Many cheminformatics libraries are written in C and C++; being able to reliably and automatically port them to Java could potentially save a great deal of effort.<br /></blockquote><br />==<a href="http://pipeline.corante.com/archives/2007/10/24/come_on_improve_already.php">Come On. Improve, Already</a>==<br /><blockquote>And man, have there ever been layoffs in the industry this year. There’s a list of the larger ones over at FierceBiotech, and it does not make for cheerful reading. In January, Pfizer announces 10,000 job cuts and closes their Ann Arbor site. That same month, Bayer-Schering closes the doors on research buildings in Connecticut and California (layoffs which were announced in 2006 and are thus not on the Fierce list). Bayer-Schering, who really should have run that B-S initial thing past a couple of native English speakers, announces 6,100 more job cuts in March. In July, AstraZeneca doubles down on its earlier layoff announcement and says that 7,600 jobs will disappear, and J&J announces a 4% reduction in its workforce (5,000 jobs). Then in August, Amgen cut over 2,000 jobs of its own.<br /></blockquote>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-30449509781203535122007-11-02T19:14:00.000+01:002007-11-03T03:25:29.105+01:00Six Rules For Creating a Data Driven Drug Design Project<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/commons/a/ad/ThinkingMan_Rodin.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 152px; height: 203px;" src="http://upload.wikimedia.org/wikipedia/commons/a/ad/ThinkingMan_Rodin.jpg" alt="" border="0" /></a><blockquote>"<span style="font-style: italic;">The problem is that we live in the most perfect imperfect medium in the world: the Web.</span><br /><span style="font-style: italic;">For now it is impossible to collect data perfectly. It is ugly, it is dirty, it is incomplete and no matter how hard you try it is not going to get perfect.</span><br /><span style="font-style: italic;">Yet we can’t resist.</span>" [<a href="http://www.kaushik.net/avinash/2007/10/six-rules-for-creating-a-data-driven-boss.html">Occam's razor</a>]<br /></blockquote>Hahaha ... what a great statement of <a href="http://www.kaushik.net/avinash/2007/10/six-rules-for-creating-a-data-driven-boss.html">Avinash</a>. Lets translate his statement about "Six Rules For Creating A Data Driven Boss!" into a <a href="http://en.wikipedia.org/wiki/Drug_design">drug design</a> setting for getting "Six Rules For Creating a Data Driven Drug Design Project!".<br /><br /><span style="font-weight: bold;"># 1: Get Over Yourself</span><br /><blockquote>"<span style="font-style: italic;">In the past, many experimentalists shared a similar sentiment regarding of the usefulness of theoretical and computational methods to their drug discovery problems, namely: You're always explaining to me after the fact why a molecule i've already discovered is active, but you're never able to tell me ahead of time which molecules I should be investigating.</span>" [<a href="http://www.netsci.org/Science/Cheminform/feature06.html">VanDrie</a>]<br /></blockquote>If you have a better reasoning about drug design then random guessing then you should really get over yourself and <a href="http://en.wikipedia.org/wiki/Decision_making">give your opinion</a>. You are working in a team, so appreciate challenging questions and feel free to challenge others as well. Otherwise, everybody can still hope for <a href="http://en.wikipedia.org/wiki/Serendipity">serendipity</a>. But ...<br /><blockquote>"<span style="font-style: italic;">Hope is not a strategy</span>" [<a href="http://en.wikiquote.org/wiki/Hope">H. R. Clinton</a>]</blockquote><span style="font-weight: bold;"># 2: Embrace Incompleteness</span><br /><blockquote>"<span style="font-style: italic;">I have heard that if the 'drug universe' included all the possible drug--like molecules, its size would be 10^62 molecules (other authors have proposed 10^200). Even if half of the currently known chemicals were drugs (a very optimistic assumption), this would produce 10^7 molecules. Therefore, at most, only 10^-53% of the possible drug--like molecules are known.</span>" [<a href="http://dx.doi.org/10.1016/S1359-6446%2899%2901393-8">R. Lahana</a>]<br /></blockquote>Just accept that we are working in an area where the <a href="http://miningdrugs.blogspot.com/2007/01/sampling-biased-data.html">chemical space</a> is just too large for allowing us to get the complete picture of it. Thus, <a href="http://miningdrugs.blogspot.com/2006/10/bounded-rational-drug-design.html">embrace incompleteness (bounded rational drug design)</a>.<br /><br />Sometimes even <a href="http://en.wikipedia.org/wiki/Rational_ignorance">rational ignorance</a> might help a project, but you have to know how much effort it would be creating a <a href="http://en.wikipedia.org/wiki/Infinity">complete</a> data set.<br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp3.blogger.com/_cUdwVNvWCbc/RyuavkC0D-I/AAAAAAAAAEo/TjVWp58OIDM/s1600-h/infinity.gif"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp3.blogger.com/_cUdwVNvWCbc/RyuavkC0D-I/AAAAAAAAAEo/TjVWp58OIDM/s200/infinity.gif" alt="" id="BLOGGER_PHOTO_ID_5128362742611447778" border="0" /></a><br /><span style="font-weight: bold;"># 3: Give 10% Extra</span><br />If you are asked to deliver results, then try to give 10% more then requested and show that you are rather a <span style="font-weight: bold;">drug data ninja</span> than a <span style="font-weight: bold;">reporting squirrel</span>. Information rules ... prepare yourself for answering questions and for having supporting information available! You have not to be a dictionary, but having 10% extra information is not too difficult.<br /><blockquote>"<span style="font-style: italic;">Make a conscious choice what job you want to be in: </span><strong style="font-style: italic;">Reporting Squirrel</strong><span style="font-style: italic;"> or </span><strong style="font-style: italic;">Analysis Ninja</strong><span style="font-style: italic;"> ?</span>" [<a href="http://www.kaushik.net/avinash/2007/10/six-rules-for-creating-a-data-driven-boss.html">Occam's razor</a>]<br /></blockquote><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/en/9/93/Common_Squirrel.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 122px; height: 169px;" src="http://upload.wikimedia.org/wikipedia/en/9/93/Common_Squirrel.jpg" alt="" border="0" /></a><br /><span style="font-weight: bold;"># 4: Become A Marketer</span><br /><blockquote>"<span style="font-style: italic;">If you want to change your boss and your company then you’ll have to become a Marketer, someone with an understanding marketing principles, someone who can be a customer advocate / champion, someone who can evangelize the purpose of data in creating customer centric decisions.</span>" [<a href="http://www.kaushik.net/avinash/2007/10/six-rules-for-creating-a-data-driven-boss.html">Occam's razor</a>]</blockquote>Everbody in drug design knows that 'real' data is the key for developing 'real' drugs. It costs many efforts and many people creating it. Drug design is not clicking buttons and counting bits and bytes. This is a big difference to <a href="http://en.wikipedia.org/wiki/Web_analytics">web analytics</a>. In drug design the data creation is an active process of the service provider, while in web analytics the data will be produced by any visitor to a web page (service provider).<br /><br />Creating a new data point needs a team of passionated persons, so appreciate any single data point.<br /><br />After the data is available, everybody in the team will be really excited to know if this data point will be the ultimative <a href="http://en.wikipedia.org/wiki/Blockbuster_drug">blockbuster</a>.<br /><br />Make clear, that data might help <a href="http://www.vfa.de/en/statistics/innovation/">reducing risks</a>. And even more important, do not sell things the data does not provide! Allow the data to influence a project, but avoid that overinterpreted data is misleading a project. Following marketing strategies might be one way of ranking data.<br /><br /><span style="font-weight: bold;"># 5: Business In The Service Of Data. Not.</span><br /><blockquote>"<span style="font-style: italic;">It is important, nay critical, to constantly check yourself and ask if the business is really serving the needs of data or vice a versa.</span>" [<a href="http://www.kaushik.net/avinash/2007/10/six-rules-for-creating-a-data-driven-boss.html">Occam's razor</a>]</blockquote> As said before is the data generation in drug design a time consuming and costly business. This means it is crucial to check if you can really learn from the data.<br /><br />Ask yourself: Are we learning from the data?<br /><br />If the answer is no, then something is wrong. Why are you producing data which costs time and money? Is the team not wasting their efforts?<br /><br />Do not spend all your time in analyzing the data. Spend time in talking to the team for producing interesting data in the future.<br /><br />Where do you want to go ? Go ahead! Avoid recursive data generation and conclusion cycles, which can keep you busy a while, but certainly not employed for the next years.<br /><br /><span style="font-weight: bold;"># 6: Adopt A New Mindset, Expand Horizons: Drug Design 2.0.</span><br />In drug design it does not count how many drugs you produce, it only counts how good they are.<br /><br />Business intelligence (BI) and business performance management (BPM) can optimize the process. So what? If you are in a deadloop, it does not matter if you are in a normal or an optimized deadloop for data generation and data analysis? Get out of it!<br /><br />Check what the competitors are doing! And if you are lucky, you can find this information in less than <a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">six steps</a> ;-)<br /><br /><span style="font-weight: bold;">Resources</span><br /><ul><li>Thinking Man <a href="http://en.wikipedia.org/wiki/Image:ThinkingMan_Rodin.jpg">@Wikipedia</a></li><li>J. H. VanDrie, <span style="font-style: italic;">3D Database Searching in Drug Discovery</span>, Network Science, <span style="font-weight: bold;">1995</span>. <a href="http://www.netsci.org/Science/Cheminform/feature06.html">URL</a></li><li>Infinity <a href="http://en.wikipedia.org/wiki/Image:Infinity_symbol.svg">@Wikipedia</a></li><li>Squirrel <a href="http://en.wikipedia.org/wiki/Image:Common_Squirrel.jpg">@Wikipedia</a><br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-80064657421495201622007-10-25T01:39:00.000+02:002007-10-25T02:00:09.493+02:00Uniprot: Over a billion protein database entries<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://beta.uniprot.org/"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 211px; height: 96px;" src="http://bp1.blogger.com/_cUdwVNvWCbc/Rx_YWEC0D8I/AAAAAAAAAEY/w2Q8Y_91yew/s320/uniprot_logo.gif" alt="" id="BLOGGER_PHOTO_ID_5125052774525177794" border="0" /></a>As posted by <a href="http://lists.w3.org/Archives/Public/public-semweb-lifesci/2007Oct/0113.html">Eric Jain</a> (via <a href="http://chem-bla-ics.blogspot.com/2007/10/one-billion-biochemical-rdf-triples.html">Egon</a>) are now more than a billion <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a> available.<pre style="font-style: italic;" id="body"><blockquote>The latest release of the <a href="http://beta.uniprot.org/">UniProt</a> protein database contains just over a<br />billion triples*! PRESS RELEASE :-)<br />The data is all available via the (Semantic or otherwise) Web:<br /><a href="http://beta.uniprot.org/">http://beta.uniprot.org/</a><br />...or can be bulk-downloaded from:<br /><a href="ftp://ftp.uniprot.org/">ftp://ftp.uniprot.org/</a><br />* Counting some reification statements, and assuming no overlap between<br />"named graphs".<br />P.S. This should be the last you'll hear from me on this topic -- I'm off<br />to new adventures...</blockquote></pre>Furthermore is it <a href="http://www.expasy.uniprot.org/about/UniProtKB_RefSeq_crossref.txt">notable</a> that<br /><blockquote style="font-style: italic;">The UniProt Knowledgebase (UniProtKB) and NCBI Reference Sequence (RefSeq) in conjunction with Entrez Gene are widely used resources for protein information, including function, classification and cross-references. Prior efforts in ID mapping (iProClass ID mapping table: <a href="http://pir.uniprot.org/search/idmapping" class="linkifyplus">http://pir.uniprot.org/search/idmapping</a>) and recent collaborative efforts by UniProt and NCBI has resulted in the mapping of UniProtKB and RefSeq/Entrez Gene (GeneID) and incorporation of reciprocal cross-references in these databases. These cross-references will provide up-to-date links between the UniProt (<a href="http://www.uniprot.org/" class="linkifyplus">http://www.uniprot.org</a>) and NCBI databases (<a href="http://www.ncbi.nlm.nih.gov/RefSeq" class="linkifyplus">http://www.ncbi.nlm.nih.gov/RefSeq</a>). Improving these mappings and increasing their coverage is an ongoing process.</blockquote>Finally, is there a nice <a href="http://www.debugmode.com/wink/">wink</a> tutorial for <a href="http://en.wikipedia.org/wiki/Diabetes_mellitus">diabetes</a> around, which can be started <a href="http://beta.uniprot.org/demos/diabetes">here</a>.<br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://beta.uniprot.org/demos/diabetes"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_cUdwVNvWCbc/Rx_agEC0D9I/AAAAAAAAAEg/bZKi7Zet1mg/s320/demo_diabetes_thumbnail.png" alt="" id="BLOGGER_PHOTO_ID_5125055145347125202" border="0" /></a>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-30162537985412296742007-10-24T01:50:00.000+02:002007-10-24T02:37:14.848+02:00SuperTarget, Matador, ChemSpider: When data curation becomes a hype<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://insilico.charite.de/supertarget/"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px;" src="http://insilico.charite.de/supertarget/images/logo02.gif" alt="" border="0" /></a><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.chemspider.com/"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp0.blogger.com/_cUdwVNvWCbc/Rx6OA5tZemI/AAAAAAAAAEQ/_k4UlfX84Bo/s320/chemspider_bar.gif" alt="" id="BLOGGER_PHOTO_ID_5124689572136581730" border="0" /></a><blockquote style="font-style: italic;"> Although the first version of SuperTarget with all the search<sup> </sup>and discovery tools around drug-target relations is already<sup> </sup>an extensive resource for both large-scale research and in depth<sup> </sup>analysis, the captured knowledge is still far from complete<sup> </sup>and we would like to invite the community to help in increasing<sup> </sup>quality and quantity of the records. SuperTarget offers an option<sup> </sup>to upload and incorporate drug-target relations into a working<sup> </sup>queue. Uploaded entries will be reviewed and approved by an<sup> </sup>annotation team comprised of graduated scientists. Both <a href="http://insilico.charite.de/supertarget">SuperTarget</a><sup> </sup>and <a href="http://matador.embl.de/">Matador</a> can be used as knowledge sources, discovery tools<sup> </sup>or training sets for various applications in chemical biology<sup> </sup>and elsewhere. [DOI <a href="http://dx.doi.org/10.1093/nar/gkm862"><span style="">10.1093/nar/gkm862</span></a>]<br /></blockquote>Dear community, as already <a href="http://miningdrugs.blogspot.com/2007/01/why-wasting-time-if-others-can-help-you.html">highlighted earlier</a> is it a logical step that public data curation services like <a href="http://insilico.charite.de/supertarget">SuperTarget</a><sup>,</sup> <a href="http://matador.embl.de/">Matador</a>, and <a href="http://www.chemspider.com/">ChemSpider</a> become more popular. Lets see how far those services might go, but they should make things <a href="http://miningdrugs.blogspot.com/2007/09/six-sigma-in-drug-design-old-hands-or.html">more efficient</a> and help bringing <a href="http://miningdrugs.blogspot.com/2007/08/six-degrees-of-drug-design-social.html">light into the chaos</a>. And yes, we are still living in a <a href="http://miningdrugs.blogspot.com/2007/09/wikipedia-dbpedia-and-remaining.html">chaotic (data) world</a> and every help counts.<br /><br />For more access details for the databases contact the developers and authors. Here is some basic information.<br /><blockquote><i style="font-style: italic;">SuperTarget</i><span style="font-style: italic;"> and </span><i style="font-style: italic;">Matador</i><span style="font-style: italic;"> are available via their web sites, </span><a style="font-style: italic;" href="http://insilico.charite.de/supertarget">http://insilico.charite.de/supertarget</a><sup style="font-style: italic;"> </sup><span style="font-style: italic;">and </span><a style="font-style: italic;" href="http://matador.embl.de/">http://matador.embl.de</a><span style="font-style: italic;">. They can be obtained via a Creative</span><sup style="font-style: italic;"> </sup><span style="font-style: italic;">Commons Attribution-Noncommercial-Share Alike 3.0 License.</span> [DOI <a href="http://dx.doi.org/10.1093/nar/gkm862"><span style="">10.1093/nar/gkm862</span></a>]<br /></blockquote><blockquote><span style="font-style: italic;">You have limited rights in this regard. You can only assemble a database of 5000 structures or less, and their associated properties, from our database without our permission. You can download up to 1000 structures per day from the website. Please contact us at feedbackATchemspiderDOTcom to request an extension outside this constraint. We are willing to provide the ENTIRE database of ChemSpider structures at your request - the file will consist of InChI Strings, InChIKeys and ChemSpider IDs. These constraints are under regular review so please feel free to engage us in conversation. </span>[<a href="http://www.chemspider.com/FAQ.aspx">ChemSpider FAQ</a>]<br /></blockquote><br /><span style="font-weight: bold;">References</span><br /><ul><li>Günther S, Kuhn M, Dunkel M, Campillos M, Senger C, Petsalaki E, Ahmed J, Urdiales EG, Gewiess A, Jensen LJ, Schneider R, Skoblo R, Russell RB, Bourne PE, Bork P, Preissner R.<br /><span style="font-style: italic;">SuperTarget and Matador: resources for exploring drug-target relationships</span>,<br />Nucleic Acids Res., <span style="font-weight: bold;">2007</span>, ASAP alert.<br />DOI <a href="http://dx.doi.org/10.1093/nar/gkm862">10.1093/nar/gkm862</a><br />PMID <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&cmd=retrieve&list_uids=17942422">17942422</a><br /></li></ul>Joerg Kurt Wegnerhttp://www.blogger.com/profile/09112376168632883058noreply@blogger.comtag:blogger.com,1999:blog-26255983.post-78054186810228854322007-10-16T20:24:00.000+02:002007-10-17T00:46:55.022+02:00Open data, closed data, or the question about intellectual properties<blockquote><span style="font-style: italic;">Now, this poses quite a challenge: different licenses, different copyright holders, requirements to provide access to the source (for the Open data), etc, all in one system. Quite a challenge indeed, because ChemSpider is now required to track copyright and license information for each bit of information.</span> <span style="font-weight: bold;">[</span><a style="font-weight: bold;" href="http://chem-bla-ics.blogspot.com/2007/10/chemspider-suse-gnulinux-of-chemical.html">chem-bla-ics</a><span style="font-weight: bold;">]</span><br /></blockquote><i></i><blockquote><i>... these techniques have traditionally been considered the realms of scientists fro