geospider 0.0.5
geo-spider
crawl all GEO metadata, features:
crawl platforms
crawl samples
crawl series
incremental crawling
missed crawling
Table of Contents
installation
output file format
logs
platforms
denovo crawling
incremental crawling
missed crawling
samples
denovo crawling
incremental crawling
missed crawling
series
denovo crawling
incremental crawling
missed crawling
installation
pip install geo-spider
output file format
geo-spider saves files in jsonlines form,
Refer to this site for details.
logs
geo-spider default generate logs to geo-spider.log(current directory)
in WARNING level, you can customize by -d and -l options.
-d to enable debug mode
-l specify customized log file
geo-spider -d -l new-geo-spider.log <sub-command>
platforms
platforms denovo crawling
geo-spider platforms -o platforms.jl
platforms incremental crawling
If you have a crawled platforms jsonlines file:
geo-spider platforms -cf platforms.jl -o new-platforms.jl
If you have multiple platforms jsonlines files:
geo-spider platforms -cd platforms -o new-platforms.jl
platforms missed crawling
Specify -cf or -cd like incremental crawling, add a -m option.
geo-spider platforms -cf platforms.jl -m missed -o new-platforms.jl
samples
samples denovo crawling
geo-spider samples -o samples.jl
samples incremental crawling
geo-spider samples -pcf platforms.jl -cf samples.jl -o new-samples.jl
samples missed crawling
geo-spider samples -pcf platforms.jl -cf samples.jl -m missed -o new-samples.jl
series
series denovo crawling
geo-spider series -o series.jl
series incremental crawling
geo-spider series -pcf platforms.jl -scf samples.jl -cf series.jl -o new-series.jl
series missed crawling
geo-spider series -pcf platforms.jl -scf samples.jl -cf series.jl -m missed -o new-series.jl
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.