Data Management Tools

Highlights

  • Comprehensive data management toolkit
  • Sophisticated signalling network tools
  • Comprehensive documentation

Phylosopher pursues an essentially data-agnostic policy and comes with a treasure chest of tools for automatically administering content.

Key Features

  • Integration of arbitrary genomes, both eukaryotic and prokaryotic, as well as sequence databases from popular public and commercial data providers
  • Scalable system that can simultaneously deal with thousands of genomes
  • Loading, genomic mapping and export of proprietary sequences such as clones and primers
  • Generation of genome-wide electronic Northerns
  • Orthology calculations for arbitrary genomes
  • Transcription factor binding site prediction for arbitrary genomes
  • Biological signaling network management, including:
    • Consolidation of pathway data from multiple data providers into a single super-pathway
    • Automatic portioning of pathway data into annotated and human-manageable sub-networks
  • Automated annotation pipeline for sequence feature calculations
  • Programming framework for creation of robust custom data management software
  • Synchronizing and optimizing the Oracle® database performance when working with very large data volumes

The management tools operate on standard Linux/Unix server hardware and are designed for massive parallel execution. The system architecture is designed for multiple host environments, and allows integration into cluster computing farms for data mining. The key concepts of data integration and a description of the management tools including program manuals are available in comprehensive documentation.