Code Search for Developers
 
 
  

cps.php from eXtensible Genome Data Broker at Krugle


Show cps.php syntax highlighted

<?php
if(empty($SITEDEF_H)){require('SITEDEF.php');}
if(empty($PARAM_H)){require('getPARAM.php');}
require('SSI_GDBprep.php');
virtual("${CGIPATH}SSI_GDBgui.pl/TWO_COLUMN_HEADER/" . $SSI_QUERYSTRING);
?>

<STYLE TYPE="text/css">
.breakout {color:#336666; font:bold 150% stencil,cursive;}
.ref {font:small-caps 100% monospace;}
.super {vertical-align:super; font:normal 0.5em monotype;}
.footnote {font:normal 0.75em monotype; margin-left:5px;}
#tfig { float:right;  margin:2px; border:2px solid #336666; background:#CCCC99;}

H1 {color:#336666; text-align:center; margin-bottom:2px;}
H3 {color:black; text-align:center; margin-top:2px;}

DIV#methods {text-align:justify; width:450px;}
  DIV#methods H3 {padding:1px 5px; border:1px solid maroon; margin-bottom:2px; background:#336666; color:#FFFFFF; font:bold 100% stencil,cursive;}
  DIV#methods P.synopsis {margin-top:2px; margin-left:0px;}

DIV.figure {clear:right; min-height:450px; margin:5px 0px; border:2px solid #336666; background:#CCCC99;}
  DIV.figure H3 {text-align:left; padding:2px 25px; margin:0px 0px 2px 0px; background:#999966; color:#FFFFFF; font:bold 100% stencil,cursive;}
  DIV.figure IMG { float:right; margin:2px; }
  DIV.figure P {}

DIV#discussion {width:100%;}
  DIV#discussion H2 {clear:both; padding:2px; margin-left:25px; margin-bottom:2px; background:#999966; color:#336666; font:bold 1em sans-serif}
  DIV#discussion H2 A.list_cases {position:absolute; right:8%; font-size:75%;}
  DIV#discussion IMG {margin-right:4px; vertical-align:bottom; padding:2px; border:1px solid grey; background:#FFFFFF;}
  DIV#discussion IMG.inline {float:right; clear:right; vertical-align:text-top; margin:2px 10px; border:1px solid #336666;}
  DIV#discussion P {margin:10px 5px 15px 25px; text-align:justify;}
  DIV#discussion P.sub {margin-top:1.5em; clear:right;}
  DIV#discussion H4 {margin:5px 0px 2px 25px;}

 </STYLE>

<DIV ID="mainWLS">

<H1>Locating the Cleavage / Polyadenylation Site (CPS)</H1>
<H3>( the 3' boundary of the mature mRNA )</H3>

<DIV ID='abstract'>
<P>Determining the exact nucleotide (or nucleotide region) at which transcription concludes for a gene
has been an active field of study for over three decades. However, with the increased availability of genomic sequence,
and thus an inherent need for gene annotation, tools and methods for accurate prediction of the downstream extent
of a gene are in demand now more than ever. Herein we discuss current knowledge of the biological properties
of the CPS as well as methods for locating an approximate region of termination given available EST and cDNA evidence.
</P>
</DIV>

<IMG ID='tfig' SRC='./figa.png'>
<DIV ID='methods'>
<H3>The Biology</H3>
<P CLASS='synopsis'>In eukaryotes, the common RNA-POL II transcribed gene undergoes the processes of maturation
(ie. capping, splicing, and polyadenylation) concurrent with transcription. While the exact mechanism which
terminates transcription is unproven, the common assumption is that a lack of processivity of the RNA-POL II 
caused by dissociation of stabilizing elongation factors in concert with cleavage/polyadenylation allow termination
to occur at the next thermodynamically favorable location (Zhao, 1999). Thus the exact point of pre-mRNA transcript termination
may or may not be conserved. However the site of cleavage/poyladenylation is generally static as shown by mutation
studies which produce transcriptional run-off when this site is altered. While the extent of transcriptional excess
may have regulatory consequences, identification of the cleavage/polyadenylation site is sufficient to determine the 
sequence extent of mature mRNA as related to the annotation of a given transcriptional unit.
</P>

<H3>Genomic Landmarks for Locating a CPS</H3>
<P CLASS='synopsis'>Cleavage / Polyadenylation sites are largely difficult to predict based
on sequence motifs due to their short and often degenerate nature. 3' Expressed Sequence Tags (ESTs) however, provide 
an empirical method for locating the CPS. By accurate spliced alignment of the 3' ESTs, cluster groups can be used to 
determine a localized region containing the CPS (Gautheret, 1998). For this analysis, 98,313 3' ESTs were clustered into 13,148 multi-member
clusters. Variation of the aligned 3' boundary within these Three-Prime EST Groups (TPEGs) is shown in figure 1. 
As previously observed (Graber, 1999), multiple CPS signals closely spaced (on the order of tens of nucleotides) 
result in variation of the 3' EST aligned ends (again on the order of tens of nucleotides). This analysis confirms
that over 95% of the current gene-model annotations containing TPEGs have 3' boundaries within 150 nucleotides of the TPEG defined
CPS (the vast majority have less than a 10 base difference). However exceptions do exists and have been flagged as 
possible alternative polyadenylation sites, false gene extensions, and false intronic gene mergers.
</P>
</DIV>

<DIV CLASS='figure'>
<H3>Figure 1: Three-Prime EST Group Variance</H3>
<IMG SRC='./fig1.png'>
<P CLASS='footnote'>This graph represents the difference in nucleotide position of each Three-Prime EST Group member 
(ie. each 3' EST) as compared to the median 3' boundary of the cluster. Values range from 0 to 93. The most prevalent 
positional difference is 0 representing alignment termination at a common nucleotide. 95% of the TPEG member ESTs have
variance of less than 16 nucleotides.
</P>
<span style='clear:right;' /><!-- hack to get IE to extend the DIV -->
</DIV>

<DIV CLASS='figure'>
<H3>Figure 2: TPEG CPS -vs- GeneModel Boundary</H3>
<IMG SRC='./fig2.png'>
<P CLASS='footnote'>This graph represents the difference in nucleotide position of each predicted TPEG defined CPS as compared 
to the annotated 3' boundary of the containing Gene Model. 95% of all TPEG defined CPSs lie within 150 nucleotides of the 
annotated 3' gene boundary. Variations much larger than this threshold are considered flagable events (alternative polyadenylation,
, false gene extensions, and false intronic gene mergers).
</P>
<span style='clear:right;' /> <!-- hack to get IE to extend the DIV -->
</DIV>


<DIV ID='discussion'>
<A HREF='./index.php'>Return to GAEVAL page</A>
</DIV>

</DIV>

<?php
require('SSI_GDBprep.php');
virtual("${CGIPATH}SSI_GDBgui.pl/STANDARD_FOOTER/" . $SSI_QUERYSTRING);
?>



See more files for this project here

eXtensible Genome Data Broker

The xGDB project provides scientists with an online portal for the integration of diverse sources of genomic data. Portals allow researchers to effectively target a specific scientific question by customizing their interactions with available data.

Project homepage: http://sourceforge.net/projects/xgdb
Programming language(s): JavaScript,Perl,PHP
License: other

  examples/
    altspl.png
    amb3.png
    amb5.png
    att.png
    cov0.png
    cov1.png
    cov2.png
    cov3.png
    cov4.png
    fission.png
    fussion.png
    ovext.png
    unext.png
  GLquery.css
  GLquery.js
  cps.php
  fig1.png
  fig2.png
  figa.png
  gaeval_lists.php
  index.php