Publication List

2010

Mularoni, Loris, Ledda, Alice, Toll-Riera, Macarena, Albà, M Mar

Natural selection drives the accumulation of amino acid tandem repeats in human proteins. (Article)

Genome research, 20 (6), pp. 745–54, 2010, ISSN: 1549-5469.

(Abstract | Links | BibTeX | Tags: Amino Acid, Amino Acid Sequence, Amino Acids, Amino Acids: chemistry, Amino Acids: genetics, Animals, Genetic, Humans, Molecular Sequence Data, Proteins, Proteins: chemistry, Proteins: genetics, Repetitive Sequences, Selection, Sequence Homology)

@article{Mularoni2010,
title = {Natural selection drives the accumulation of amino acid tandem repeats in human proteins.},
author = {Mularoni, Loris and Ledda, Alice and Toll-Riera, Macarena and Albà, M Mar},
url = {http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2877571&tool=pmcentrez&rendertype=abstract},
issn = {1549-5469},
year = {2010},
date = {2010-01-01},
journal = {Genome research},
volume = {20},
number = {6},
pages = {745--54},
abstract = {Amino acid tandem repeats are found in a large number of eukaryotic proteins. They are often encoded by trinucleotide repeats and exhibit high intra- and interspecies size variability due to the high mutation rate associated with replication slippage. The extent to which natural selection is important in shaping amino acid repeat evolution is a matter of debate. On one hand, their high frequency may simply reflect their high probability of expansion by slippage, and they could essentially evolve in a neutral manner. On the other hand, there is experimental evidence that changes in repeat size can influence protein-protein interactions, transcriptional activity, or protein subcellular localization, indicating that repeats could be functionally relevant and thus shaped by selection. To gauge the relative contribution of neutral and selective forces in amino acid repeat evolution, we have performed a comparative analysis of amino acid repeat conservation in a large set of orthologous proteins from 12 vertebrate species. As a neutral model of repeat evolution we have used sequences with the same DNA triplet composition as the coding sequences--and thus expected to be subject to the same mutational forces--but located in syntenic noncoding genomic regions. The results strongly indicate that selection has played a more important role than previously suspected in amino acid tandem repeat evolution, by increasing the repeat retention rate and by modulating repeat size. The data obtained in this study have allowed us to identify a set of 92 repeats that are postulated to play important functional roles due to their strong selective signature, including five cases with direct experimental evidence.},
keywords = {Amino Acid, Amino Acid Sequence, Amino Acids, Amino Acids: chemistry, Amino Acids: genetics, Animals, Genetic, Humans, Molecular Sequence Data, Proteins, Proteins: chemistry, Proteins: genetics, Repetitive Sequences, Selection, Sequence Homology}
}

2007

Mularoni, Loris, Veitia, Reiner A, Albà, M Mar

Highly constrained proteins contain an unexpectedly large number of amino acid tandem repeats. (Article)

Genomics, 89 (3), pp. 316–25, 2007, ISSN: 0888-7543.

(Abstract | Links | BibTeX | Tags: Amino Acid, Amino Acid Sequence, Animals, Complementary, Conserved Sequence, DNA, Evolution, Genetic, Humans, Mice, Molecular, Point Mutation, Proteins, Proteins: chemistry, Proteins: genetics, Repetitive Sequences, Selection, Trinucleotide Repeats)

@article{Mularoni2007,
title = {Highly constrained proteins contain an unexpectedly large number of amino acid tandem repeats.},
author = {Mularoni, Loris and Veitia, Reiner A and Albà, M Mar},
url = {http://www.ncbi.nlm.nih.gov/pubmed/17196365},
issn = {0888-7543},
year = {2007},
date = {2007-01-01},
journal = {Genomics},
volume = {89},
number = {3},
pages = {316--25},
abstract = {Single-amino-acid tandem repeats are very common in mammalian proteins but their function and evolution are still poorly understood. Here we investigate how the variability and prevalence of amino acid repeats are related to the evolutionary constraints operating on the proteins. We find a significant positive correlation between repeat size difference and protein nonsynonymous substitution rate in human and mouse orthologous genes. This association is observed for all the common amino acid repeat types and indicates that rapid diversification of repeat structures, involving both trinucleotide slippage and nucleotide substitutions, preferentially occurs in proteins subject to low selective constraints. However, strikingly, we also observe a significant negative correlation between the number of repeats in a protein and the gene nonsynonymous substitution rate, particularly for glutamine, glycine, and alanine repeats. This implies that proteins subject to strong selective constraints tend to contain an unexpectedly high number of repeats, which tend to be well conserved between the two species. This is consistent with a role for selection in the maintenance of a significant number of repeats. Analysis of the codon structure of the sequences encoding the repeats shows that codon purity is associated with high repeat size interspecific variability. Interestingly, polyalanine and polyglutamine repeats associated with disease show very distinctive features regarding the degree of repeat conservation and the protein sequence selective constraints.},
keywords = {Amino Acid, Amino Acid Sequence, Animals, Complementary, Conserved Sequence, DNA, Evolution, Genetic, Humans, Mice, Molecular, Point Mutation, Proteins, Proteins: chemistry, Proteins: genetics, Repetitive Sequences, Selection, Trinucleotide Repeats}
}

Albà, M M, Tompa, P, Veitia, R A

Amino acid repeats and the structure and evolution of proteins. (Article)

Genome dynamics, 3 pp. 119–30, 2007, ISSN: 1660-9263.

(Abstract | Links | BibTeX | Tags: Amino Acid, Animals, Base Composition, Evolution, Humans, Molecular, Open Reading Frames, Open Reading Frames: genetics, Peptides, Peptides: chemistry, Proteins, Proteins: chemistry, Proteins: genetics, Repetitive Sequences)

2006

Mularoni, Loris, Guigó, Roderic, Albà, M Mar

Mutation patterns of amino acid tandem repeats in the human proteome. (Article)

Genome biology, 7 (4), pp. R33, 2006, ISSN: 1465-6914.

(Abstract | Links | BibTeX | Tags: Amino Acid, Amino Acid Substitution, Amino Acid: genetics, Codon, Expressed Sequence Tags, Genetic, Humans, Mutation, Polymorphism, Protein, Proteome, Proteome: genetics, Repetitive Sequences, Sequence Analysis)

2004

Huang, Hui, Winter, Eitan E, Wang, Huajun, Weinstock, Keith G, Xing, Heming, Goodstadt, Leo, Stenson, Peter D, Cooper, David N, Smith, Douglas, Albà, M Mar, Ponting, Chris P, Fechtel, Kim

Evolutionary conservation and selection of human disease gene orthologs in the rat and mouse genomes. (Article)

Genome biology, 5 (7), pp. R47, 2004, ISSN: 1465-6914.

(Abstract | Links | BibTeX | Tags: Amino Acid, Amino Acid: genetics, Animal, Animals, Chromosome Mapping, Chromosome Mapping: methods, Conserved Sequence, Conserved Sequence: genetics, Disease Models, Evolution, Fishes, Fishes: genetics, Fungal, Fungal: genetics, Genes, Genes: genetics, Genes: physiology, Genetic, Genetic Diseases, Genome, Helminth, Helminth: genetics, human, Humans, Inborn, Inborn: genetics, Inborn: physiopathology, Insect, Insect: genetics, Mice, Molecular, Mutagenesis, Mutagenesis: genetics, Nucleic Acid, Nucleotides, Nucleotides: genetics, Point Mutation, Point Mutation: genetics, Rats, Repetitive Sequences, Selection, Sequence Homology, Trinucleotide Repeat Expansion, Trinucleotide Repeat Expansion: genetics)

Albà, M Mar, Guigó, Roderic

Comparative analysis of amino acid repeats in rodents and humans. (Article)

Genome research, 14 (4), pp. 549–54, 2004, ISSN: 1088-9051.

(Abstract | Links | BibTeX | Tags: Amino Acid, Amino Acid: genetics, Amino Acid: physiology, Animals, Chromosome Mapping, Chromosome Mapping: methods, Chromosome Mapping: statistics & numerical data, Computational Biology, Computational Biology: methods, Computational Biology: statistics & numerical data, GC Rich Sequence, GC Rich Sequence: genetics, Humans, Mice, Proteins, Proteins: chemistry, Proteins: genetics, Proteins: physiology, Rats, Repetitive Sequences, Trinucleotide Repeats, Trinucleotide Repeats: genetics)

@article{Alba2004,
title = {Comparative analysis of amino acid repeats in rodents and humans.},
author = {Albà, M Mar and Guigó, Roderic},
url = {http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=383298&tool=pmcentrez&rendertype=abstract},
issn = {1088-9051},
year = {2004},
date = {2004-01-01},
journal = {Genome research},
volume = {14},
number = {4},
pages = {549--54},
abstract = {Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.},
keywords = {Amino Acid, Amino Acid: genetics, Amino Acid: physiology, Animals, Chromosome Mapping, Chromosome Mapping: methods, Chromosome Mapping: statistics & numerical data, Computational Biology, Computational Biology: methods, Computational Biology: statistics & numerical data, GC Rich Sequence, GC Rich Sequence: genetics, Humans, Mice, Proteins, Proteins: chemistry, Proteins: genetics, Proteins: physiology, Rats, Repetitive Sequences, Trinucleotide Repeats, Trinucleotide Repeats: genetics}
}

2002

Albà, M Mar, Laskowski, Roman A, Hancock, John M

Detecting cryptically simple protein sequences using the SIMPLE algorithm. (Article)

Bioinformatics (Oxford, England), 18 (5), pp. 672–8, 2002, ISSN: 1367-4803.

(Abstract | Links | BibTeX | Tags: Algorithms, Amino Acid, Amino Acid Sequence, Amino Acid: genetics, Databases, Genetic, Genetic Variation, Internet, Minisatellite Repeats, Minisatellite Repeats: genetics, Models, Molecular Sequence Data, Protein, Protein: methods, Proteins, Proteins: chemistry, Repetitive Sequences, Saccharomyces cerevisiae, Saccharomyces cerevisiae: genetics, Sensitivity and Specificity, Sequence Analysis, Sequence Homology, Software, Statistical)

2010

2007

2006

2004

2002

Recent Posts

Share