Automated de novo sequencing of nucleic acids by liquid chromatography-tandem mass spectrometry

Automated de novo sequencing of nucleic acids by liquid chromatography-tandem mass spectrometry


Oberacher,H.; Mayr,B.M.; Huber,C.G.;

We present the first global computer-aided sequencing algorithm for the de novo determination of short nucleic acid sequences. The method compares the fragment ion spectra generated by collision-induced dissociation of multiply charged oligodeoxynucleotide-ions to the m/z values predicted employing established fragmentation pathways from a known reference sequence. The closeness of matching between the measured spectrum and the predicted set of fragment ions is characterized by the fitness, which takes into account the difference between measured and predicted m/z values, the intensity of the fragment ions, the number of fragments assigned, and the number of nucleotide positions not covered by fragment ions in the experimental spectrum. Smaller values for the fitness indicate a closer match between the measured spectrum and predicted m/z values. In order to find the sequence most closely matching the experimental spectrum, starting from a given nucleotide composition all possible oligonucleotide sequences are assembled followed by identification of the correct sequence by the lowest fitness value. Using this concept, sequences of 5- to 12-mer oligodeoxynucleotides were successfully de novo determined. High sequence coverage with fragment ions was essential for obtaining unequivocal sequencing results. Moreover, the collision energy was shown to have an impact on the interpretability of tandem mass spectra by the de novo sequencing algorithm. Experiments revealed that the optimal collision energy should be set to a value just sufficient for complete fragmentation of the precursor ion

J Am.Soc.Mass Spectrom. 2004 15(1):32-42
PubMed: 14698553