Abstract
Udgivelsesdato: February 2005
Originalsprog | Engelsk |
---|---|
Tidsskrift | Cladistics |
Vol/bind | 21 |
Udgave nummer | 1 |
Sider (fra-til) | 15-20 |
ISSN | 0748-3007 |
Status | Udgivet - 2005 |
Adgang til dokumentet
Citationsformater
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS
Sequence length variation, indel costs, and congruence in sensitivity analysis. / Aagesen, Lone; Petersen, Gitte; Seberg, Ole.
I: Cladistics, Bind 21, Nr. 1, 2005, s. 15-20.Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review
}
TY - JOUR
T1 - Sequence length variation, indel costs, and congruence in sensitivity analysis
AU - Aagesen, Lone
AU - Petersen, Gitte
AU - Seberg, Ole
PY - 2005
Y1 - 2005
N2 - The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation in the data set. Dominance was easily detected, as the character-based congruence measures approached their optimal value when indel costs were incremented. Dominance of a fragment or data partition was overwhelmed when new sequence length-variable fragments or data partitions were added.
AB - The behavior of two topological and four character-based congruence measures was explored using different indel treatments in three empirical data sets, each with different alignment difficulties. The analyses were done using direct optimization within a sensitivity analysis framework in which the cost of indels was varied. Indels were treated either as a fifth character state, or strings of contiguous gaps were considered single events by using linear affine gap cost. Congruence consistently improved when indels were treated as single events, but no congruence measure appeared as the obviously preferable one. However, when combining enough data, all congruence measures clearly tended to select the same alignment cost set as the optimal one. Disagreement among congruence measures was mostly caused by a dominant fragment or a data partition that included all or most of the length variation in the data set. Dominance was easily detected, as the character-based congruence measures approached their optimal value when indel costs were incremented. Dominance of a fragment or data partition was overwhelmed when new sequence length-variable fragments or data partitions were added.
M3 - Journal article
VL - 21
SP - 15
EP - 20
JO - Cladistics
JF - Cladistics
SN - 0748-3007
IS - 1
ER -