Protocol Online logo
Top : New Forum Archives (2009-): : DNA Methylation and Epigenetics

Finding Promoter Sequence for gene - Read many "how to" guides, but can't actually find it. (Mar/07/2011 )

Hi, I am looking to find the promoter region of a gene...except I am not sure how to get to it. I've read a tons of threads on "how to find promoter region of sequence", but still am confused on how/where to proceed.

Things I have repeatedly read, but have trouble understanding why they work that way:

1)Go to ensemble, look up gene, export sequence ~ first 1000 bp, blast this to make sure you have right region. I dont understand how getting the first 1000 bp will ALWAYS give you the promoter? Then they say put the sequence into primer design program (methylprimer express), and just "assume"? where the CpG islands pop up are..thats where your promoter is? (lets say you have 2 cpg islands..how do you know where the promoter is? is it one of the cpg islands, or does it span both...?)

2) + / - strand... i understand that one is sense (+) the other is (-) antisense strand..but why is it important to incorporate this when you are looking for promoter sequences?

Thanks for your help! (my gene is AXIN2) if anyone wants to show me with screenshots...that would be HIGHLY appreciated! :D

-sri2010-

Here is the -1 kb promoter sequence of human AXIN2 gene. If you can reliably determine the transcription start site, then you know where the promoter is. If you want to analyze DNA methylation of the promoter and there are multiple cpg island, you can just examine the one that is close to the transcription start site or core promoter. You don't need to worry the minus strand.

-1000 AAAAGTTAATTATTCGCTCATAACATAAGGAATCATTCTATAGCAGCAGCAGAATAAATA
CACAATGCTCTGGGTTTGTGACATATTTTCTCATTTTAACGGAGAAGAGAAGGTTGTGTG
TGCCTTATAAACGTTTTTGGGCAAATGTGGGAGAAATTCAAAGGTGACAATGGGGTGAGG
AGTTTTCGATACATTAAAACGGCATGCATGCCTTGTCGTTTGGAGTGGGAAAAAAGATAT
TCCTATCGGTTTAATATGTAACATATGGCAGAAGAGGGGACTTTGGAAGAGTAACGGGTA
TGCAGTTGTTTGGGAGAGATGGCCAATGTGTGCCGCTACGAGGGTCTTATGGCCAGCAGG
TTAGGAAGCTGGACCTAGGTCTTCAGGGCACTGAGGTATGCCTCCTAACTGGTGTTAGCA
CTCAGGCCTATACTGGCGGATTCACACCCCACTTTCACACATGCACAAACGCAATTCATC
CACCAACTGCAGTCCTATATTTACCCGAACTCAGATGCACGTGCACAAGCGCGCACAGGC
ATGCACACGCCTCCAGCGGCTGGGCCCGAGTCCGCCCTATGACAGCGGGCCAGCCAACCC
CAGGGCCGGAACTCCCTTTATAAAGTCCTCCAAGCCCAAATTCCCAGGGAAACTTGAATA
GAGGTAGAAAAGCGTTTGATGATAGAAAAAAAAAAATCTGCTTGCTTGCAGTTCTTGCGA
TGGCCATAATGCTACCAGAAACTGTTGCATTCCAGAGCCGGGGAGAGCGAGGGCAGGCAG
GGGGAGGAGGCGGGAGCAAAGAGGCCGGAGAACTTAAAAGAAGAAATCAGAACTCGGGCT
GAGAATGCCCTCGGGGCCGCGGGGGGCTGGCAGGGGGCGGAGGAGAGGCCGGGGAAGGGT
TAACGCGGGAGCGGAGCGTGCGGCGGGGGCGGGCCGGGGCGGGGCGGGGCGGGCGCGGGG
CGCTGTCCCTTTAAGGCGGGCGGCGCGGGCCGGCCGAGCG +1 CGGCTGTGATTGGCGCGGCG

-pcrman-