[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Subject Index][Author Index]

Re: New paper on Neoaves



David Marjanovic wrote:

That was a late-night oversight on my part. If some of the taxa in a data matrix have a base-
composition bias, an evolution model estimated from the data can clearly lead to a wrong result.

Yes. But even worse, adding data can actually make the situation worse by making the erroneous topology well-supported, and deluding you into thinking that the 'fake' clade is 'real'. The tree looks 'better' (nicer bootstrap or posterior probabities) but the topology is 'worse', because homoplasy is no longer the 'noise' but the signal.


Adding taxa seems to be a good idea in general, however.

I general, yes. Nevertheless, I think we need a method to determine whether more taxa are helping the analysis, rather than just assuming that more taxa will help.


There's also the matter that improved taxon sampling. Some lineages make life difficult by leaving us only a few lingering species. Try resolving the relationships of hyracoids, elephants, and sea cows, for example. Between them they muster about a dozen extant species, and even less genera. (OK, you might get lucky and get your hands on some Steller's sea-cow or mammoth DNA; but even then the taxon sampling dries up at 13 species.) Then there's the tubulidentates, perissodactyls, etc., which are also stingy in this regard. I'm not blaming them, of course. They tried their hardest to survive into the Present. And it's not as though humans made it easy for them.

Since this was 1998 (you know, when some people honestly believed Rodentia was paraphyletic to the rest of Placentalia, and Passeriformes the sister-group to all other extant birds), I'll simply blame the model, and perhaps the fact that Bayesian analysis was not yet available.

No, the "wrong model" isn't the problem here; they tried many different models (four substitution, four among-site rate-heterogeneity), which are used today. Bayesian analysis didn't solve the problem either (I know one of the authors personally).


The reason is obvious: molecular characters all look the same and are thus very boring,

To a morphologist, yet. But people who study molecules beg to differ. We need to get these people on board so we can delve deeper into the sequence data and get a handle on what exactly is driving the signal, if not phylogeny.


Cheers

Tim