SciTegic Pipeline Pilot - Gene Expression Collection
Category Genomics>Gene Expression Analysis/Profiling/Tools
Abstract The Gene Expression Collection allows you to process, analyze, visualize, annotate, and report on gene expression experiments, including the individual target genes. Core functionality is based on BioConductor - - the open source and open development software project for the analysis and comprehension of genomic data.
By harnessing the graphical protocol building capabilities of SciTegic Pipeline Pilot, the Gene Expression Collection allows you to construct complex BioConductor-based workflows without writing code, while also making it easy for you to couple Gene Expression analysis with other SciTegic Pipeline Pilot-based processes, such as sequence analysis (see G6G Abstract Number 20069) and reporting. Products features/capabilities include:
With the Gene Expression Collection you can:
1) Analyze and annotate gene expression experiments.
2) Use BioConductor tools without programming in R, the well-known public domain package for statistical computing and graphics on which BioConductor is based.
3) Easily create protocols to compare different analysis methods.
4) Integrate R/BioConductor analyses with analyses conducted with other SciTegic Pipeline Pilot Collections, including the R Statistics and Text Analytics Collections (see G6G Abstract Number 20063).
5) Create comprehensive reports containing elements from both R graphics and the Reporting Collection.
Experiment Readers -- Affymetrix® and Agilent experiments can be read and processed using several BioConductor packages (affy, affyPLM, plier, limma). Processing includes background correction, normalization, and summarization.
Excel experiment readers can import data directly from existing spreadsheets. GEO and SOFT readers enable processing of data from NCBI’s Gene Expression Omnibus (see G6G Abstract Number 20013).
Experiment Analysis -- Fold change and pairwise differential expression are quickly calculated for full experiments using Student’s t-test or Wilcoxon tests with optional control for multiple comparisons. Gene subsets can be defined based on calculations or annotations -- all without copying or cloning data.
Subsets can also be defined using generic components like Outlier Filter or Top N Filter. Set operations (e.g., union, intersection, and subtraction) can be performed on existing subsets. Clustering components, including hierarchical and k-means, are also provided.
Data Manipulators -- Gene expression experiments can be quite large. Once a subset of interesting genes has been identified, the irrelevant portions of the experiment can be removed from the experiment record, greatly speeding up downstream processing. Additionally, individual genes can be extracted, leading to easy integration with the Sequence Analysis Collection.
Results Annotators -- Annotations [e.g., descriptions, pathway identifiers, and Gene Ontology (GO) terms] can be added using vendor provided data via BioConductor’s annaffy package. In addition, annotations can be imported from existing flat files.
Viewers and Reporting Tools -- Use BioConductor graphics or create your own graphics and reports using SciTegic Pipeline Pilot’s Reporting Collection. While BioConductor provides sophisticated plots and charts, they can be difficult to configure and use.
The Gene Expression Collection makes it easy to use these standard BioConductor reporting capabilities, while also giving you the option to implement additional data views using the Reporting Collection. This gives you more control over display features, including hyperlinks and tooltips. Supplied components include heat maps, annotated tables, and parallel coordinate plots.
Integration with Other SciTegic Pipeline Pilot Collections -- The components in the Gene Expression Collection are designed to work seamlessly with components from existing collections, including the Sequence Analysis, R Statistics, Data Modeling, and Chemistry (via pathways) Collections.
System Requirements
System Requirements for SciTegic Enterprise Server and Pipeline Pilot 7.0 (April 2008)
Operating System Requirements
Pipeline Pilot 7.0 is supported on Linux and Windows.
Supported Linux Operating Systems
- • Red Hat Enterprise Linux 4 WS or ES (any update), 32-bit x86. (This 32- bit Red Hat configuration is also supported on 64-bit hardware.)
- • Red Hat Enterprise Linux 5 WS or ES, 64-bit x86
- • SUSE Linux Enterprise Server and Desktop version 10, 64-bit x86 with a 32-bit install
- • Mozilla Firefox 2.x or another Mozilla-supported browser
- • Root privileges may be required for optional configurations
Supported Windows Server Operating Systems
- •Window Server 2003 with Service Pack 1
- • Internet Explorer 6.0 or 7.0, or Mozilla Firefox 2.x.
- • Optional: Microsoft Office 2007, 2003, XP or 2000, (for use with generic Excel components). ActiveState Python 2.1 or later (required on server for protocols that use Python scripting components).
- • System Administrator privileges are required for Windows server installations
Supported Windows Client Operating Systems
- • Windows XP (Service Pack 2) or Windows Vista
- • Internet Explorer 6.0 or 7.0, or Mozilla Firefox 2.x.
- • Windows Power User privileges (required on client machines as registry settings are updated during installation)
- • Optional: Microsoft Office 2007, 2003, XP or 2000, (for use with generic Excel components). Spotfire Decision Site 7.2 or later (for use with Spotfire Viewer components).
- Note: Separately purchased component collections may require additional third-party software. See the install guides for details.
Minimum Hardware Requirements
Server Hardware
- • Pentium or Xeon class computer, 2.0 GHz (dual core) or 3.0 GHz
- • As a guide, 1 CPU for every 2 simultaneous users should provide adequate performance
- • 4 GB RAM (3 - 6 GB may be required for 64-bit Linux), plan on at least 1 GB of RAM per CPU
- • Disk space (installation): 3 - 5 GB
- • Disk space (runtime environment, user data files, etc.): 30 GB (Disk space depends on the number of installed packages and the data types used with your packages. For example, disk space needed to support the Sequence Analysis package depends on the number of sequences and their length, as well as the type of data you need to use with these components.)
- Note: Swap space is also critical. The available swap file size should be at least 1 to 1.5 times the size of the physical RAM installed. For example, if you have a machine with 2 GB RAM installed, you should have at least 2 GB of swap available.K
Client Hardware
- • Pentium class computer (1 GHz or faster)
- • 512 MB RAM
- • 150 MB free disk space
Manufacturer
- Accelrys, Inc.
- 10188 Telesis Court. Suite 100
- San Diego, CA 92121-4779
- Tel: 1-858-799-5000
- Fax: 1-858- 799 5100
- support-scitegic@accelrys.com
Manufacturer Web Site Accelrys, Inc.
Price Contact manufacturer.
G6G Abstract Number 20172
G6G Manufacturer Number 100084