Skip to content

Quality of a dataset #128

@ms609

Description

@ms609

Haag et al. measure the ruggedness of a tree landscape by training a regression model (trained on molecular datasets, implemented in C) based on:

  • Unique topologies after 100 parsimony searches: 42.9 %
  • RF-Distance between parsimony trees: 33.2 %
  • Entropy (Average Shannon entropy per column): 17.0 %
  • Patterns (unique columns)-over-taxa 13.6 %
  • % Gaps 2.5 %
  • Bollback 2.3 %
  • Sites(n columns)-over-taxa 1.5 %
  • % Invariant columns 0.6 %

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions