Revision as of 20:48, 15 March 2007 edit Daniellebourque (talk \| contribs) 36 edits No edit summary ← Previous edit		Revision as of 21:06, 15 March 2007 edit undo Daniellebourque (talk \| contribs) 36 edits No edit summary Next edit →
Line 1: The '''Biomolecular Object Network Databank''' ('''BOND''') is a ~~database~~bioinformatics ofdatabank ofcontaining information on small molecule ~~interactions~~and protein sequences, ~~which~~structures and interactions. The databank integrates a number of existing databses ~~and~~to ~~provides~~provide ~~additional~~a ~~functionality~~comprehensive overview of the information currently available for a given molecule. ==Background== {{Infobox Software\|name=BOND\|developer=Christopher Hogue et al., Samuel Lunenfeld Research Institute, Mount Sinai. Commercial rights: Unleashed Informatics\|latest_release_version=BIND 4.0, SMIDsuite\|genre=Bioinformatics tool\|license=Open Access\|website=[http://bond.unleashedinformatics.com/index.jsp?pg=0]}} The [[Blueprint Initiative]] started as a research program in the lab of [[Dr. Christopher Hogue]] at the [[Samuel Lunenfeld Research Institute]] at [[Mount Sinai Hospital]] in [[Toronto]]. On [[December 14]] [[2005]] [[Unleashed Informatics Limited]] acquired the commercial rights to The Blueprint Initiative [[intellectual property]]. This included rights to the protein interaction database BIND, the small molecule interaction database SMID, as well as the data warehouse SeqHound. Unleashed Informatics is a data management service provider and is overseeing the management and curation of The Blueprint Initiative under the guidance of Dr. Hogue.<ref>[http://www.blueprint.org Blueprint.org]</ref> ==Construction== BOND integrates the original Blueprint Initiative databases as well as other databases, such as [[Genbank]], combined with many tools required to analyze these data. Annotation links for sequences, including [[taxon identifiers]], [[redundant sequences]], [[~~Genome~~Gene Ontology ~~descriptions~~]] descriptions, [[Online Mendelian Inheritance in Man]] identifiers, [[conserved domains]], data base cross-references, [[LocusLink Identifiers]] and complete genomes are also available. Bond facilitates cross-database queries and is an [[open access]] resource which integrates interaction and sequence data~~[2]~~. <ref>[http://bond.unleashedinformatics.com BOND at Unleashed Informatics]</ref> ==Small Molecule Interaction Database (SMID)== The Small Molecule Interaction Database is a database containing protein ___domain-small molecule interactions. It uses a [[___domain-based approach]] to identify [[___domain families]], found in the [[Conserved Domain Database ([[CCD)]]), which interact with a [[query]] small molecule. The CCD from [[NCBI]] amalgamates data from several different sources; [[Protein FAMilies]] (PFAM)]], [[Simple Modular Architecture Research Tool]] (SMART)]], [[Cluster of Orthologous Genes]] (COGs)]], and NCBI’s own curated sequences. The data in SMID is derived from the Protein Data Bank (PDB), a database of known protein crystal structures. SMID can be queried by entering a [[protein GI]], [[___domain identifier]], [[PDB ID]] or [[SMID ID]]. The results of a search provide small molecule, protein, and ___domain information for each interaction identified in the database. Interactions with non-biological contacts are normally screened out by default. SMID-BLAST is a tool developed to annotate known small-molecule binding sites as well as to predict binding sites in proteins whose [[crystal structures]] have not yet been determined. The prediction is based on extrapolation of known interactions, found in the PDB, to interactions between an uncrystallized protein with a small molecule of interest. SMID-BLAST was validated against a test set of known small molecule interactions from the PDB. It was shown to be an accurate predictor of protein-small molecule interactions; 60% of predicted interactions identically matched the PDB annotated binding site, and of these 73% had greater than 80% of the binding residues of the protein correctly identified. Hogue, C et al. estimated that 45% of predictions that were not observed in the PDB data do in fact represent true positives[3]. Line 17: ==Biomolecular Interaction Network Database (BIND)== ===Introduction=== The idea of a database to document all known molecular interactions was originally put forth by [[Tony Pawson]] in the 1990’s and was later developed by scientists at the [[University of Toronto]] in collaboration with the [[University of British Columbia]]. The development of the Biomolecular Interaction Network Database (BIND) has been supported by grants from the Canadian Institutes of Health Research ([[CIHR]]), Genome Canada, the Canadian Foundation for Innovation and the Ontario Research and Development Fund. BIND was originally designed to be a constantly growing depository for information regarding biomolecular interactions, molecular complexes and pathways. As [[proteomics]] is a rapidly advancing field, there is a need to have information from scientific journals readily available to researchers. BIND facilitates the understanding of molecular interactions and pathways involved in cellular processes and will eventually give scientists a better understanding of developmental processes and disease pathogenesis The major goals of the BIND project are: to create a public proteomics resource that is available to all; to create a platform to enable [[datamining]] from other sources (PreBIND); to create a platform capable of presenting visualizations of complex molecular interactions. From the beginning, BIND has been [[open access]] and software can be freely distributed and modified. Currently, BIND includes a data specification, a database and associated data mining and visualization tools. Eventually, it is hoped that BIND will be a collection of all the interactions occurring in each of the major model organisms. Line 51: ==User statistics== The number of [[Unleashed Registrants]] has increased 10 fold since the integration of BIND. As of December 2006 registration fell just short of 10,000. Subscribers to the commercial versions of BOND fall into six general categories; [[agriculture]] and [[food]], [[biotechnology]], [[pharmaceuticals]], [[informatics]], [[materials]] and other. The biotechnology sector is the largest of these groups, holding 28% of subscriptions. Pharmaceuticals and informatics follow with 22% and 18% respectively. The [[United State]] holds the bulk of these subscriptions, 69%. Other countries with access to the commercial versions of BOND include [[Canada]], the [[United Kingdom]], [[Japan]], [[China]], [[Korea]], [[Germany]], [[France]], [[India]] and [[Australia]]. All of these countries fall below 6% in user share[2]. ==References==

Biomolecular Object Network Databank: Difference between revisions