Enteric PIPs   The Enteric server


Reference genome:

Address or gene name in the reference genome:

Address ranges: ECO: 1-4639221, STM: 1-4951371, ECH: 1-5528445, STY: 1-5133713, SPA: 1-4585228, KPN: 1-5694892 and ESA: 1-4530777. The STM, STY, KPN and ESA sequences include the plasmids.

The complete gene lists for the reference genomes are here (ECO, STM, STY, ECH, SPA, KPN, ESA).
 

Genomes to be included in the display (up to 11). A list of abbreviations is here. By default, the first 11 genomes are selected.

E. coli K-12 views:
          ECH ECU ECA ECP SHG SHD SHS STM STY SPA SPC
          KPN ERC ERA SMA YPE YEN VCH PAE ESA CKO (user)

S. typhimurium LT2 views:
          SSL STA STF STY STT SPA SPB SPR SPC SEP SEN SEB
          SDU SGA SNE SNL SSA SDI SAR SHO SBO ECO ECH ECU
          ECA ECP SHG SHD SHS KPN ERC ERA SMA YPE YEN VCH (user)

E. coli O157:H7 views:
          ECO ECU ECA ECP SHG SHD SHS STM STY SPA SPC
          KPN ERC ERA SMA YPE YEN VCH PAE ESA CKO (user)

S. typhi CT18 views:
          STT STM SSL STA STF SPA SPB SPR SPC SEP SEN SEB
          SDU SGA SNE SNL SSA SDI SAR SHO SBO ECO ECH ECU
          ECA ECP SHG SHD SHS KPN ERC ERA SMA YPE YEN VCH (user)

S. paratyphi A views:
          STY STT SPB SPR SPC STM SSL STA STF SEP SEN SEB
          SDU SGA SNE SNL SSA SDI SAR SHO SBO ECO ECH ECU
          ECA ECP SHG SHD SHS KPN ERC ERA SMA YPE YEN VCH (user)

K. pneumoniae views:
          KOX STY STT STM SSL STA STF SPA SPB SPR SPC SEP
          SEN SEB SDU SGA SNE SNL SSA SDI SAR SHO SBO ECO
          ECH ECU ECA ECP SHG SHD SHS ERC ERA SMA YPE YEN
          VCH (user)

E. sakazakii views:
          CKO ECO ECH ECU ECA ECP SHG SHD SHS STM STY
          SPA KPN ERC ERA SMA YPE YEN VCH PAE (user)
 

User sequence (optional). No FASTA headers, please!




Please NOTE that some queries may require more than 30s to complete.


Attention: if you use Adobe Acrobat to view these PDF files, be sure to read these instructions!
 


Description


The Enteric server produces a graphical, hypertext view of the pairwise alignments of the selected reference genome (i.e., E. coli K-12, E. coli O157:H7, S. typhimurium LT2, S. typhi CT18, S. paratyphi A, K. pneumoniae or Enterobacter sakazakii) with several related bacteria covering 20Kb around a user-specified position or gene in the reference organism. Other organisms can be integrated as sequence data become available. For each of the compared bacteria, all significant pairwise alignments with the reference sequence in the selected region were computed with a locally developed DNA sequence alignment tool called blastz. The alignment information is displayed as a set of ``PIPs'' (Percent Identity Plots), in which positions in the reference sequence are shown along the horizontal axis and each match between the reference sequence and the other bacterial sequence is represented as a horizontal line. The percent identity of each match, a value between 50% and 100%, is then indicated by its vertical coordinate.

Additional information is embedded in each PIP. For instance, in the E. coli K-12 and O157:H7 comparative views, clicking on E. coli gene names (shown at the top of the PIP) links to their Entrez Gene classifications. Pointing at an alignment reveals the name of the contig matching that E. coli region. Colored vertical stripes at the ends of some alignments indicate the reason why that alignment terminated in the other species. Red marks sequences in the other species whose immediate neighbor has a homolog elsewhere in E. coli, and pointing at the stripe displays that other E. coli address. Blue marks sequences in the other species whose immediate neighbor has no detectable homolog in E. coli, and pointing at the stripe displays the length of the unmatched sequence. The other colored regions have no associated messages; for instance, yellow is used for regions of the reference sequence not found in the other species. Regions apparently not sequenced in the other species are marked with gray. NOTE: the significance of the color bar features for alignments in the user sequence is somewhat different. Since the sequence provided was aligned only with the 20Kb query region, and not with the entire reference genome, for performance reasons, the annotations should be interpreted with respect to that range.



Page last revised Jun 29th, 2008.