Liver-specific regulatory sequences


The following human regulatory sequences were used as a positive test set in

Krivan W, Wasserman WW.
A predictive model for regulatory sequences directing liver-specific transcription.
Genome Res. 2001 Sep;11(9):1559-66. Abstract

and were used to develop an alogrithm to identify known regulatory sequences in genes selectively expressed in liver.

The sequences have been mapped to their positions in the August version of the Genome Browser.

please see William Krivan's web page for further information on individual transcription factor binding sites within these sequences

return to homepage

> CYP7A_cholesterol_7alpha-hydroxylase chr8:66877157-66877358 AAGAGACTCAAGCTAGGCTTTTTATATACATAGTATCCAGATCCATTAAC TTGAGCTTGGTTGACAAAGCAAACAATTAGCCATTTGTTCATTCTATTAG AAAAAAAAAAGTGGTAGTAACTGGCCTTGAACTAAGTCCACAGGTATCAG AAGTGGTTCCAAAGCAATCAGAGACCTGCAATACTTGATAAGTTGAAGGT C

>PAH_phenylalanine_hydroxylase chr12:119360576-119360777 CACAAGATGAGAAGTTGTGTACTTGGCAAACTTAGAGCTGACCTTTGCTG ATTTGGAAGTTGAAGATTACCCAACCATTGCAGGTTTATCAGTTCTTTCT TGTTTATCTTCATGTGCAGAAGGTTGAGTTAATCATAATCCATGAGTTCA TGGCACAGAAACAAAACCTACATGACCCTTCTCTTGTTTTTTTATTCATT C

>The PAH_phenylalanine_hydroxylase regulatory region has a second aligning region of 100% identity on chromosome 3, as determined by a BLAT search. chr3:153419411-153419612
CACAAGATGAGAAGTTGTGTACTTGGCAAACTTAGAGCTGACCTTTGCTG ATTTGGAAGTTGAAGATTACCCAACCATTGCAGGTTTATCAGTTCTTTCT TGTTTATCTTCATGTGCAGAAGGTTGAGTTAATCATAATCCATGAGTTCA TGGCACAGAAACAAAACCTACATGACCCTTCTCTTGTTTTTTTATTCATT C

> glucose-6-phosphatase chr17:45127594-45127795 ACTGCCAAGAAGCATGCCAAAGTTAATCATTGGCCCTGCTGAGTACATGG CCGATCAGGCTGTTTTTGTGTGCCTGTTTTTCTATTTTACGTAAATCACC CTGAACATGTTTGCATCAACCTACTGGTGATGCACCTTTGATCAATACAT TTTAGACAAACGTGGTTTTTGAGTCCAAAGATCAGGGCTGGGTTGACCTG A

>protein_C_gene chr2:132012552-132012753
GACGGCATCCTTGGTGGGCAGAGGTGGGCTTCGGGCAGAACAAGCCGTGC TGAGCTAGGACCAGGAGTGCTAGTGCCACTGTTTGTCTATGGAGAGGGAG GCCTCAGTGCTGAGGGCCAAGCAAATATTTGTGGTTATGGATTAACTCGA ACTCCAGGCTGTCATGGCGGCAGGACGGCGAACTTGCAGTATCTCCACGA C

>IGF-I_insulin-like_growth_factor_I chr12:118920814-118921015 TCTCCCTCTTCTGGCAAAGTTATTGAGTAAGGACTTTTTTGGGCATGGTG ACAAATAACATCATACCTTTGCATTTTAAAACTAGAGCACAGAAGCATTT TTTTCCCTTAAAAGAATGTGTGTTAGTGACAGGGTTCGCAGACATTAAAA TACTTATGCTGCCATAGAAAATAAGGATCTGTTTTCTGATTAACTTTCTG C

> bilirubin_UDP-glucuronosyltransferase_UGT1*1 chr2:245338967-245339168 TGAGTATGAAATTCCAGCCAGTTCAACTGTTGTTGCCTATTAAGAAACCT AATAAAGCTCCACCTTCTTTATCTCTGAAAGTGAACTCCCTGCTACCTTT GTGGACTGACAGCTTTTTATAGTCACGTGACACAGTCAAACATTAACTTG GTGTATCGATTGGTTTTTGCCATATATATATATATAAGTAGGAGAGGGCG A

> aldolase_B chr9:113088480-113088681
TTCAAACTAATACTGTTTACAGGGAGTTAAACTTCTACAGTGGGATTAAA GGTCTGTACCACGTTAGCACAAATGTCACCTCTCTGTTAATCATAAAACA GGGTCACAGGCCAATGTTCACCACAAGGAGACAGGAGGACAACCTGGGAT GGGTAATGACAAAGAACGATTTCCGTACTCCTAAGCCTCTGCTCTCTCAG A

> insulin chrNA_random:12657879-12658080
TGGAAAGTGGCCCAGGTGAGGGCTTTGCTCTCCTGGAGACATTTGCCCCC AGCTGTGAGCAGGGACAGGTCTGGCCACCGGGCCCCTGGTTAAGACTCTA ATGACCCGCTGGTCCTGAGGAAGAGGTGCTGACGACCAAGGAGATCTTCC CACAGACCCAGCACCAGGGAAATGGTCCGGAAATTGCAGCCTCAGCCCCC A