PHP

Python

CSS

Javascript

HTML

Ruby

C++

icePick 0.0.5

Last updated: August 29, 2024

0 purchases

Free

Donate

Creator: rpa-with-ash

Languages

Python

Description:

icePick 0.0.5

IcePick is a All in one Package library for easy Scraping

Concept

Lightweight Scraping Library
All in one Package library for easy Scraping

Requirements

Python 3.4 or later(not support 2.x)
MongoDB

Dependencies Libraries

aiohttp
beautifulsoup4
pymongo >= 3.0
nose

Usage
Scraping Flow,
Your Scraping Order(Order) -> Do Scraping(Picker) -> HTML Parse(Parser) -> Save in Database(Recorder)

Example
get a my repository filenames
import icePick

db = icePick.get_database('icePick_example', 'localhost')

class GithubRepoParser(icePick.Parser):
def serialize(self):
result = {
"files": [],
}

for v in self.bs.find_all(class_="js-directory-link"):
result['files'] += [v.text]
return result

class GithubRepoRecorder(icePick.Recorder):
struct = icePick.Structure(files=list())

class Meta:
database = db

class GithubRepoOrder(icePick.Order):
recorder = GithubRepoRecorder
parser = GithubRepoParser

def main():
document = {
'url': 'https://github.com/teitei-tk/ice-pick/tree/master',
'ua': 'Mozilla/5.0 (Windows NT 6.3; Trident/7.0; rv:11.0) like Gecko',
}

print('---download start---')
order = GithubRepoOrder(document.get('url'), document.get('ua'))
picker = icePick.Picker([order])
picker.run()
print("---finish---")

if __name__ == "__main__":
main()
>>> import icePick
>>> db = icePick.get_database('icePick_example', 'localhost')
>>> class GithubRepoRecorder(icePick.Recorder):
... struct = icePick.Structure(files=list())
... class Meta:
... database = db
...
>>> records = GithubRepoRecorder.find()
>>> records[0].files
['example', 'icePick', 'tests', 'LICENSE', 'README.md', 'circle.yml', 'requirements.txt']
>>>

TODO

Crawling
Document

LICENSE

MIT

License:

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

There are no reviews.

zed

icePick 0.0.5

Languages

Categories

Description:

License:

Share

Overview

What you can do with it

What you can't do with it

Related Products

Views For YouTube Bot writed on Python

AI-Web-Scraper

quivr

roop

More From This Creator

zzz

zyre

zug

zssh

zshdb

icePick 0.0.5

Languages

Categories

Description:

License:

Share

Customer Reviews

License

Overview

What you can do with it

What you can't do with it

Related Products

Views For YouTube Bot writed on Python

AI-Web-Scraper

quivr

roop

zed

More From This Creator

zzz

zyre

zug

zssh

zshdb