parett 1.0.1

Creator: railscoder56

Last updated:

Add to Cart

Description:

parett 1.0.1

PAReTT
Python Automated Retrieval of TimeTree data
version 1.0.1, Windows OS
Introduction
PAReTT is a menu-driven module used to interact with the Time Tree resource, specifically
designed to automate batch retrieval of data for a list of species. Three main types of data can be retrieved using PAReTT:
(1) Divergence times, between an individual pair or between all species in a list, (2) Evolutionary timelines, for indivuals
or a list species, and (3) Time trees of the divergence times, either for all available species within a specified taxon or
between individual species supplied as a list.
When working with a list of species, the best place to start is by using the first menu option to check the data availability
of species in the list and removing any species for which data is not available.
Dependencies:

Python >= 3.6
Numpy >= 1.20.1
Pandas >= 1.2.4
Bio >= 1.3.9 (for handling newick trees)
Splinter >= 0.17.0 (for interacting with the server)
Selenium >= 4.1.5
Geckodriver >= 0.31.0 (Added to PATH)
Firefox browser

Installation:
After downloading and extracting the zip archive PAReTT can be implemented in by navigating to the directory and using one of two methods:
python parett.py

or
python setup.py install
python -m parett

-> This option will install the relevant dependencies automatically
A pre-compiled stand-alone Windows executable is also available.
Main menu
The following options are available through the main menu:
MAIN MENU:
----------------------------------------
Choose one of the following options?
*) Check data availability
a) Get Divergence Times (pair)
b) Get Divergence Times (batch)
c) Get Evolutionary Timeline
d) Build a Time Tree
e) Print citation
f) Validate datafile
q) Quit
----------------------------------------

input is given as lower case '*','a','b','c','d','e','f', or 'q'
e.g.
Choice: a

*) Check data availability
Brings up the menu to first check the TimeTree.org website for availability of divergence time data of your study species.
AVAILABILITY MENU:
----------------------------------------
Choose one of the following options?
i) Individual
l) List
m) Main menu
----------------------------------------

input is given as lower case 'i', 'l', or 'm' (return to main menu)
i) Individual



Takes an individual species as input to look up data availability e.g. Passer montanus


Prints availability on screen




l) List



Takes a list of species as input in from a .txt input file e.g. Species.txt


Prints availability on screen


Provides option to save results to a file in .csv format e.g. Availability.csv


e.g.



Species
TimeTree.Data




Setophaga ruticilla
Available


Hirundo rustica
Available


Setophaga striata
Available





a) Get Divergence Times (pair)

Takes a pair of species as input to look up divergence times e.g. Taxon a: Passer montanus, Taxon b: Halcyon senegalensis
Prints divergence time of pair on screen

b) Get Divergence Times (batch)

Takes a list of species as input to look up divergence times from a .txt input file e.g. Species.txt
Prints divergence time of pair on screen
Provides option to save results to a file in .csv format e.g. Output.csv

e.g.



Taxa1
Taxa2
Div.Time




Setophaga ruticilla
Setophaga ruticilla
0


Setophaga ruticilla
Hirundo rustica
35


Setophaga ruticilla
Setophaga striata
3.52


Hirundo rustica
Setophaga ruticilla
35


Hirundo rustica
Hirundo rustica
0


Hirundo rustica
Setophaga striata
35


Setophaga striata
Setophaga ruticilla
3.52


Setophaga striata
Hirundo rustica
35


Setophaga striata
Setophaga striata
0




When retrieving data for longer lists (>5-10) server issues may result in missing values (NA) which can be checked and replaced using the data validation menu option after the run.

c) Get Evolutionary Timeline
Brings up the menu options to retrieve the evolutionary timeline:
TIMELINE MENU:
----------------------------------------
Choose one of the following options?
i) Individual
l) List
m) Main menu
----------------------------------------

input is given as lower case 'i', 'l', or 'm' (return to main menu)
i) Individual



Takes an individual species as input to look up evolutionary timeline e.g. Passer montanus


Downloads .jpg result




l) List



Takes a list of species as input in from a .txt input file e.g. Species.txt


Downloads .jpg result for each specie in list




d) Build a Time Tree
Brings up the time tree menu options
TIME TREE MENU:
----------------------------------------
Choose one of the following options?
t) Taxon
s) Species list
m) Main menu
----------------------------------------

input is given as lower case 't', 's', or 'm' (return to main menu)
t) Taxon


Takes the name for a taxon to get a time tree of all available species within the taxon e.g. Saxicola



s) Species list



Takes a list of species as input in from a .txt input file to generate a time tree e.g. Species.txt


Downloads the resulting time tree in the Newick format


Stores replaced or missing species to a .txt file e.g. replacements.txt




e) Print citation
Prints the citation for the TimeTree resource

S. Kumar, G. Stecher, M. Suleski, and S.B. Hedges, 2017. TimeTree: a resource for timelines, timetrees, and divergence times. Molecular Biology and Evolution 34: 1812-1819, DOI: 10.1093/molbev/msx116

f) Validate datafile
Brings up the datafile validation menu options
VALIDATE MENU:
----------------------------------------
a) Check missing
b) Replace missing
c) View tree
m) Main menu
----------------------------------------

input is given as lower case 'a', 'b', 'c', or 'm' (return to main menu)
a) Check missing



Used to check for missing values from running a long list of species (>10 Species)


Takes the output file (.csv) from the divergence time function and checks for any missing values
e.g.



Taxa1
Taxa2
Div.Time




Setophaga ruticilla
Setophaga ruticilla
0


Setophaga ruticilla
Hirundo rustica
NA


Setophaga ruticilla
Setophaga striata
3.52


Hirundo rustica
Setophaga ruticilla
35


Hirundo rustica
Hirundo rustica
0


Hirundo rustica
Setophaga striata
NA


Setophaga striata
Setophaga ruticilla
3.52


Setophaga striata
Hirundo rustica
35


Setophaga striata
Setophaga striata
0





If no missing values are detected, will print 'No missing values'


If missing values are detected they are printed to the screen and an attempt will be made to look up those values


Asks for file name to store the missing values as a .csv file e.g. missing.csv
e.g.



Taxa1
Taxa2
Div.Time




Setophaga ruticilla
Hirundo rustica
35


Hirundo rustica
Setophaga striata
35







b) Replace missing



Used to replace the missing values (divergence times) from a long list of species


Takes two input files, one with the divergence times and one with the missing values detected using 'Check missing'


Asks for file name to store the validated dataset of divergence times
e.g.



Taxa1
Taxa2
Div.Time




Setophaga ruticilla
Setophaga ruticilla
0


Setophaga ruticilla
Hirundo rustica
35


Setophaga ruticilla
Setophaga striata
3.52


Hirundo rustica
Setophaga ruticilla
35


Hirundo rustica
Hirundo rustica
0


Hirundo rustica
Setophaga striata
35


Setophaga striata
Setophaga ruticilla
3.52


Setophaga striata
Hirundo rustica
35


Setophaga striata
Setophaga striata
0







c) View tree


Takes a newick tree as input and renders a basic display of tree topology



q) Quit
Exits program

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.