Software CD-HIT-Grid | EGI Applications Database

Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting

Click to visit the AppDB VMOps Dashboard for deploying and managing Virtual Machines to the EGI Cloud infrastructure.read more

.noscript.softwareentry {display: block;width: 1000px;}.noscript.softwareentry .field {display: block;padding-bottom: 0.5em;padding-left: 10em;width: auto;text-align:left;}.noscript.softwareentry .field .fieldtype {color: #444444;display: inline-block;font-weight: bold;min-width: 90px;vertical-align: top;width: 90px;}.noscript.softwareentry .field .fieldvalue {display: inline-block;max-width: 780px;text-align:left;}.noscript.softwareentry .field.image {height: 100px;left: -110px;position: absolute;top: 0;width: 100px;.height: 100px;border:none;}

Name:CD-HIT-Grid

Description:Protein clustering on the Grid with CD-HIT

Abstract:CD-HIT performs protein clustering on a protein or genome sequence database. This consists in removing redundant sequences at a given sequence similarity level and generating a new database with the representatives only. As protein and genome databases are growing up day after day, the clustering process on interesting datasets in a single machine is not feasible due to memory constrains. A Grid environment allows an adaptive database distribution in order to optimize its overall analysis. This activity was proposed by CNIO (Spanish National Cancer Research Centre) and started in the context of the BioGridNet Program.