geo-spider 0.0.5

Creator: bradpython12

Last updated:

Add to Cart

Description:

geospider 0.0.5

geo-spider
crawl all GEO metadata, features:

crawl platforms
crawl samples
crawl series
incremental crawling
missed crawling

Table of Contents

installation
output file format
logs
platforms

denovo crawling
incremental crawling
missed crawling


samples

denovo crawling
incremental crawling
missed crawling


series

denovo crawling
incremental crawling
missed crawling



installation
pip install geo-spider

output file format
geo-spider saves files in jsonlines form,
Refer to this site for details.
logs
geo-spider default generate logs to geo-spider.log(current directory)
in WARNING level, you can customize by -d and -l options.

-d to enable debug mode
-l specify customized log file

geo-spider -d -l new-geo-spider.log <sub-command>

platforms
platforms denovo crawling
geo-spider platforms -o platforms.jl

platforms incremental crawling
If you have a crawled platforms jsonlines file:
geo-spider platforms -cf platforms.jl -o new-platforms.jl

If you have multiple platforms jsonlines files:
geo-spider platforms -cd platforms -o new-platforms.jl

platforms missed crawling
Specify -cf or -cd like incremental crawling, add a -m option.
geo-spider platforms -cf platforms.jl -m missed -o new-platforms.jl

samples
samples denovo crawling
geo-spider samples -o samples.jl

samples incremental crawling
geo-spider samples -pcf platforms.jl -cf samples.jl -o new-samples.jl

samples missed crawling
geo-spider samples -pcf platforms.jl -cf samples.jl -m missed -o new-samples.jl

series
series denovo crawling
geo-spider series -o series.jl

series incremental crawling
geo-spider series -pcf platforms.jl -scf samples.jl -cf series.jl -o new-series.jl

series missed crawling
geo-spider series -pcf platforms.jl -scf samples.jl -cf series.jl -m missed -o new-series.jl

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.