Comprehensive database of E. coli molecular components


About EcoCyc

Gene map for lacZ (View Page)
Genome Browse for lacZ (View Page)
Comparison of ATP synthase subunits in different species (View Page)

EcoCyc is a database of literature-based "gold standard" annotation of the molecular components of E. coli K-12, primarily referencing strain MG1655. EcoCyc is a central part of the BioCyc collection of databases at SRI International in Menlo Park, CA.


  • Genes
    • Name and synonyms
    • Nucleotide position and sequence
    • Paralogs
  • Proteins and RNAs
    • Name and synonyms
    • Minireview summary
    • Enzyme activators, inhibitors, cofactors, reaction catalyzed, alternative substrates
    • Subunit structure, molecular weight, pI, amino acid sequence
  • Metabolites
  • Metabolic reactions, transport reactions
  • Metabolic pathways
  • Regulatory network
    • Regulation by transcription factors, attenuators, small RNAs
    • Promoters, transcription factor binding sites, terminators
    • Operons (transcription units)
    • Regulatory interactions

Using EcoCyc



  • Omics viewers: Paint omics datasets onto the full metabolic map, the full genome, or the transcriptional regulatory network (desktop mode only)
  • Interactive interrogation and tracing through metabolic network (desktop mode only)
  • Interactive interrogation of regulatory network (desktop mode only)
  • Genome map browser
  • Comparative genome map browser
  • Comparative operations for genomics and metabolism


The main EcoCyc searches are based on a three-tiered search mechanism:

  • Quick search: Enter a phrase into the quick-search box and type Enter. The phrase can be a complete or partial gene name, protein name, metabolite name, pathway name, or Gene Ontology term.
  • The second tier of searching allows more elaborate object-specific searches, under the Search menu. For example, Search->Gene/Proteins/RNAs allows you to formulate searches that filter based on gene or protein name, sequence length, molecular weight, cellular location, etc.
  • The third tier of searching involves complex searches against multiple object types, and are typically used for global analysis of E. coli. See Search->Advanced.

Other searches under the Search menu include ontology-based searching and searches against the full text of E. coli articles available in EcoCyc.

This DB tells you practically everything you ever wanted to know about a gene or protein. When you insert the query, the output allows a decision between the gene or its protein product. Note that EcoCyc-authored minireviews and citations are found on the protein page, not the gene page. Example gene page: http://biocyc.org/ECOLI/NEW-IMAGE?type=GENE&object=EG10686

To get you started, here are a few things people generally want.

To get a sequence to enter into your Cloning software: If you need the gene (start to stop codons) or protein sequence, click on the appropriate tab at the very top of the page. There is also a page that allows you to specify what range of DNA you want. To get back to the parC page, click the Back button of your browser. Very simple.

To see what genes flank your gene of interest: Click on the genome browser link at the map position heading (4th heading down the page). You may see what flanks your gene as well as directionality. This tool resembles Google MapsTM, in that it has arrows that allow you to move along the gene map in either direction. To get back to the previous page, just click the Back button of your browser.

To see where the gene is w.r.t. other organisms: Click on the "Align in Multiple-Genome Browser" button that is half way down the gene page. Next, select the organisms you want to compare (the alphabetical list goes across), select "submit" and there it is. If at any time you want to change or add organisms to the alignment process, click on the "select allowed organisms" tab at the top of the page, and you are free to change your preferences. You may also change the gene you are looking at from this page as well- up at the top.

Usage examples

These papers cite EcoCyc papers listed in the reference section below. You could look at these for other examples of how this database is used.

Other sites with related content


EcoCyc runs in two modes. The desktop mode that you install locally on a Macintosh, PC, or Linux box is faster and has more extensive features than the Web version, and is what most heavy users employ. The web server mode provides information via the Internet. More information on these modes: EcoCyc Modes

The Pathway Tools software behind EcoCyc is written in Common Lisp. Pathway Tools has been applied to hundreds of other genomes, within BioCyc and by groups outside SRI.

Web Services/API

Programming interfaces in Lisp, Perl, and Java are described here.



I.M. Keseler et al, EcoCyc: A comprehensive view of Escherichia coli biology, Nucleic Acids Research 37:D464-70 2009. [Article text]

See Also

Riley M., Abe T., Arnaud M.B., Berlyn M.K.B., Blattner F.R., Chaudhuri R.R., Glasner J.D., Horiuchi T., Keseler I.M., Kosuge T., Mori H., Perna N.T., Plunkett III G., Rudd K.E., Serres M.H., Thomas G.H., Thomson N.R., Wishart D., and Wanner B.L., Escherichia coli K-12: a cooperatively developed annotation snapshot--2005 Nucleic Acids Research 34:1-9 2006.

Salgado H., Santos-Zavaleta A., Gama-Castro S., Peralta-Gil M., Penaloza-Spinola M.I., Martinez-Antonio A., Karp P.D., and Collado-Vides J., The comprehensive updated regulatory network of Escherichia coli K-12 BMC Bioinformatics 7:5 2006.

I.M. Keseler, J. Collado-Vides, S. Gama-Castro, J. Ingraham, S. Paley, I.T. Paulsen, M. Peralta-Gil and P.D. Karp. EcoCyc: A comprehensive database resource for Escherichia coli, Nucleic Acids Research 33:D334-7 2005.

Karp P.D., Arnaud M., Collado-Vides J., Ingraham J., Paulsen I.T., and Saier M.H. Jr., The E. coli EcoCyc Database: No Longer Just a Metabolic Pathway Database ASM News 70(1):25-30 2004.

External Links


