Databases and other software tools for Gene Expression

In the table below are some of the tools, databases and other resources that we have reviewed over the last few months. If you know of others that merit inclusion in this group, we would be happy to oblige. Please send us a note and give us at least a top-level URL for the resource. If you think that this list misrepresents the capabilities of an entry (ie, if an entry has been updated recently) or if you can think of additional information that would be valuable to include, we'd be grateful for the input.

Abbreviations are expanded at the bottom of the page.

Databases with Application to Gene ExpressionDemo
SW Name
(Org)
Sequence
Analysis
Gene Expr
Analysis
Database
function
Interface
Free
Database
Access?
SW
Free?
Public
Data
Deposit?
Output
Type*
Online
Docs
Schema
Pubs
Other
URLs
GeneX
(NCGR)
L clust,perm
SOM,PCA,
stats,prof
plots,viz,
eisen
DB
(PostgreSQL,
Sybase)
HTML
XWin
Java
Yes Yes Yes
w/ registration
Data
formatted for
J-Express,
xcluster,
xgobi,
or in GeneXML and soon MAML
O No Yes IBM Systems J
40(2),'01
GeneX
Listservs
maxd
(Manchester)
No 2D plots, clust,
filt,via
MaxdView
DB
(PostgreSQL,
MySQL, Oracle8i
Java No? Yes No? ASCII A   EBI's Schema No? Data
Loading
Tool
Stanford
MicroArray DB

(Stanford)
? ? DB HTML No For Acad
nonprofits
No HTML
ascii
A Yes Yes Yes No
AMAD
(UCSF,
microarrays.org)
No Yes*
via
Cluster
TreeView
Perl
flatfile DB
HTML
javascript
No Yes No ASCII A Yes Flatfile No Screenshots
FAQ
MicroArray Project
(NHGRI)
BLAST ImAn,
clustering
DB Java
HTML
Demo1
Demo2
Yes* No Java
screens
G Yes Yes Yes  
NHGRI ArrayDB
v 2.2.03

(NHGRI)
Not Apparent
fr Descr
Described,
but not
available(?)
DB
Sybase
Oracle
HTML Demo Yes w/
Registration>
No ASCII,
image,
binary data
G Few Yes Yes Change Log
cDNA DB
(NHGRI)
DB of
CloneID,
Unigene
Sets
No DB FileMakerPro No Data
Free
No FMPro G Yes No No  
ExpressDB v2.00
(Harvard)
No No DB HTML
Javascript
Yes No No* HTML
Tables
A Yes Logical Model
Physical Model
Table Defs
Yes Yes
EPODB
(U Penn)
L No DB HTML
Java
Yes No No* HTML
Tables
A Yes Yes Yes Controlled
Vocabulary
RNA Abundance
DB (U Penn)
? ? DB HTML Yes ? No* HTML A Yes No No Controlled
Vocabulary
,
Slides
GenomXtools
(VisualGenomics)
Yes* clust,
plots,
map
Int win32 No No No win32
ascii
$ No No No No
ArrayExpress
(EBI)
? ? DB ? Yes ? Yes GEML
ascii
HTML
G No Yes No Meeting
Data mining
Gene Expression
Omnibus
(NCBI)
L No DB email
HTML
Tables
Yes No Yes GEO-XML G Yes Yes No DB
Tables
GeneChip Analysis
Suite
(Affymetrix)
L ImAn,
clust,
stats
A - GATC win32 No No No win32
GATC
$ No GATC (PDF) No Data Flow
GeneChip LIMS
(Affymetrix)
No No DB win32 No No No win32
GATC
$ No GATC (PDF) No Data Flow
GeneChip Data
Mining Tool

(Affymetrix)
L clust,
stats,
plots,
viz
A - GATC win32 No No No win32
ascii
$ No GATC (PDF) No Data Flow
Array Explorer
(Spotfire)
No clust,
stats,
PCA,
eisen,
plots,
viz
A - GATC win32 No No No win32
ascii
$ Yes NA No No
Genesight
(BioDiscovery)
No ImAn,
clust,
PCA,
stats,
plots,
viz
Int win32 No No No win32
ascii
$ No No Promo/Tech Report Related
products
CloneTracker
(BioDiscovery)
No Sybase-based
LIMS DB
DB Win32 No No No win32
ascii
$ No No No Related
products
GeneSpring
(Silicon Genetics)
L clust,
PCA,
stats,
plots,
viz,
map
A Java Yes No Yes* Java
ascii
$ Full No No
Resolver
(Rosetta)
L ImAn,
clust,
PCA,
stats,
plots,
viz,
map
Int Java No No No Java
ascii
$ No No No Response to
OMG GE RFI
LifeArray
(Incyte)
Yes clust,
PCA,
stats,
plots
A - GATC, others Java No No No Java
ascii
$ No No No No
SSBM
(Informax)
Yes L Int Java No No No Java
ascii
$ No No No No
Expressionist
(GeneData)
Yes* clust,
PCA,
stats,
MDS,
plots,
viz
A Java N N N Various $ No No No No
GXD
(Jackson Labs)
No No DB HTML Yes No Soon HTML
ascii
O Yes No Yes Mouse
Anatomical
Dictionary
GeneExpress
(Gene Logic)
L clust,
PCA,
stats,
plots,
viz
Int Java No No No varies $ No No No No
ChipDB
(Whitehead)
No ? DB HTML Yes No No HTML
ascii
A No No No GENECLUSTER


Demo -->
Analytical Tools with Application to Gene Expression
SW Name
(Org)
Primary
Function
Other
Features
Links to
Database
Interface/
Platform
Free
Access?
(if web)
SW
Free?
Data
Upload
Req'd?
Output
Formats
Online
Docs
Pubs?
Other
URLs
CyberT
(UC Irvine & NCGR)
t-test
variants
for GE
datasets
2/3D viz w/ xgobi GeneX
supports
HTML
Xwin
@UCI
@NCGR
Yes Yes HTML
ascii
Xwin
postscript
PDF
Yes Contact
Author
xgobi docs
(PDF)
J-Express
U Bergen
clust, eisen
SOM, prof
PCA, plots
stats
VRML GeneX
supports
Java NA Yes NA ascii
2,3D plots
postscript
Yes Yes, See
homepage
No
ArrayMaker
(Stanford)
ArCon   No win32 NA Yes No NA Yes No PDF,
req's jogger
also free
ScanAlyze
(Stanford)
uArray
Image
Analysis
Grid Def'n
spot analysis
Preps data
for DB
win32 NA free to acad
non-profits
No Tab-delim
ascii
Yes No No
Image/J
(NIH)
GP Sci
Image
Analysis
free plugins,
analyt routines
No Java NA Yes No Tab-delim
ascii
mult image
formats
XLS
No No Listserv,
Source Code,
ImAn links
Array-Pro
(Media Cybernetics)
Array
Image
Analysis
interfaces w/
GeneMaths
(see below)
NA win32 NA No No Tab-delim
ascii
mult image
formats
No No Demo,
Promo PDF,
SDK
GenExplore
(Applied Maths)
GE Data Mining
stats,clust,
SOM,PCA,
prof,eisen
interfaces w/
ArrayPro
(see above)
Oracle,
SQL Server,
ODBC
win32 NA No NA Tab-delim
ascii
WYSIWYG printing
No No Demo,
Promo PDF
Cluster
(Stanford)
clustering on
large datasets
hierarchical,
SOMs,kmeans,
PCA
No win32 NA free to acad
non-profits
No win32,
GIFs
No Yes TreeView
XCluster
(Stanford)
cmdline var
of cluster
multi
platform
GeneX
supports
command
line
via NCGR free to acad
non-profits
No ascii Yes Yes Academic
License
TreeView
(Stanford)
view cluster output make images
for pub
No win32 NA ree to acad
non-profits
No GIFs Yes No Source
Code
EPCLUST
(EBI)
hierarchical clust'g
of GE datasets
GIF viz
of output
No HTML Yes No Yes ascii,
GIF
No No No
Partek soph. GP data
analysis tool
clust,PCA,
MDS,viz
for GE data
No tcl/tk NA No No ascii,var images,
postscript
No No Screenshot
Demo
xgobi/xgvis
(ATT Labs,
Telcordia)
soph. GP data
analysis tool
exceptional
3D viz of
multiple variables
No XWindows NA Yes No XWindows
ascii
Yes Lots (References) Links to R,S,Splus
The R
language
comprehensive
statistical
analysis
good plotting
pkg; can use xgobi for
3D viz
PostgreSQL,
commandline
unix,Mac,PC
NA Yes No ascii,postscript,
GIF,XWindows
Yes No FAQ
Data
Explorer

(IBM)
Data flow
visual programming
hi quality
viz, analytical
modules
Possible;
not simple
XWindows
Java
NA Yes No images,
ascii, HDF
Yes No No
JMAviewer
(Albert
Einstein)
view,cluster
microarray
data
calls KEGG,
BLAST,
Unigene
Yes
JDBC
Java Demo No No ? No No Yes
ArrayViewer
(TIGR)
view, sort array data data
Normal-
ization,
gene links
Yes Java NA free to acad
non-profits
No ascii,
plots,
Spotfire,
No No Yes
arrayScout
(Lion BioSci)
view, plot, sort,
clust/eisen array data
stats, BioSCOUT
interface
SRS ? No No NA images
other?
No No No
SeqArray
(GCG)
clust, plot, query, eisen views of array data SeqWeb integration SeqStore Java No No Yes images
other?
No No No
GenePix4000
(Axon)
GenePix 4000
control
ImAn No win32 No No NA TIFFs
tab delim data
No No No
GeneSight
(BioDiscovery)
2/3D plots,clust,
PCA,time series
  links to Imagene win32 No No NA ? No No AutoGene
ArrayVision
(Imaging Research)
ImAn
Normalization
Batch Operation No win32 No No NA ? PDF
No Statistical Informatics
Array Stat
(Imaging Research)
Win/GUI
Stat eval
of GE data
Filtering &
processing of data
prior to clustering,
data mining
N WinNT/2K N N N Ascii Stats, plots, No Technical Report PPT Presentation
IPLab MicroArray Suite
(Scanalytics)
ImAn integrated
with IPLab
No Mac No No NA ? Details No No
Pathways 3
(Research Genetics)
ImAn, clust, plot, query, hyperb trees   Y Java 1.2 N N N ascii,images, postscript PPT Demo N N
SMA
(Speed Lab,
UCB)
R fn()s for
Signif tests,
Normalization
incl
data
N R N Y N R formats V Good Y Example A
Example B
Dapple Jeremy Buhler
(U Wash)
ImAn tunable spot
finding for
1&2 color scans
N X11/Qt N Y N Grid file useful
tech
report
  Source
code
TIGR uArray Tools
(TIGR)
Viewers & Spotfinder clust, ImAn, norm, plots N Java 1.2 (Viewers)
Win32 (Spotfinder
N A N ? ArrayViwer
SpotFinder

Other resources

Abbreviations:

  • Sequence, Gene Expr Analysis, Primary Function
    • ArCon = Array construction, planning, layout tool
    • clust = clustering via a number of algorithms
    • SOM = Self Organizing Maps
    • perm = Permutation Testing
    • prof = profile filtering
    • eisen = Eisen-type maps and dendrograms
    • GE = Gene Expression
    • GP = General Purpose
    • ImAn = Image Analysis
    • L = Limited
    • map = chromosome maps or localization
    • MDS = Multi-Dimensional Scaling
    • NA = Not Applicable
    • norm = Normalization of data
    • PCA = Principle Component Analysis
    • plots = 2D plotting capability
    • stats = statistical tools
    • viz = 3D (or higher) visualization
    • Yes* = Available via another module in the suite.
    • ? = Unknown
  • Database Function
    • DB = is substantially a Database
    • Int = comes with an integrated Database
    • A = can access other databases
    • GATC = a particular type of Database schema designed by Affymetrix and Molecular Dynamics for storing gene expression data.
  • Interface
    • HTML = Hypertext Markup language, the lingua franca of the Web
    • Java = an interpreted language that supposedly allows "write-once, run anywhere",
    • Javascript = unrelated to Java, typically used to add lightweight, browser-based intelligence to interfaces
    • win32 = the native Microsoft Windows interface
    • tcl/tk = tcl is another interpreted language that allows it to run on many platforms; tk is the GUI interface that allows point+click utility
    • XWin = the X11 Windowing system, typically runs on Linux, Unix workstations altho there is Xserver software available for MS Windows and Macs. Allows one screen to show output from multiple networked machines simultaneously.
    • CLI = Command Line Interface, the oldest, but still powerful way of interacting with computers, as in DOS, terminal windows and shell commands.
  • SW Free?
    • Yes* - Most of the software is free, but it uses commercial software in its implementation which is NOT free.
    • A - Free to Academic & non-profits, Costs commercial entities.
  • Public Data Deposit
    • yes* = The database supports public data upload, but only via vendor-specific tools
    • no* = Much of the data is public but the public cannot directly submit their data to the database.
  • Type
    • A = Academic research project
    • G = Government organization.
    • $ = Commercial project; for gene expression especially, these products tend to be extremely expensive.
    • O = Non-profit Organization