Crawl Trulia 0.0.4 | GitLocker.com Product

Free

Languages

Python

Description:

crawl trulia 0.0.4

Welcome to crawl_trulia Documentation
This is a small project provide url route, html parse tools to crawl www.trulia.com.

Quick Links

GitHub Homepage
PyPI download
Install
Issue submit and feature request

Usage
A real example:
>>> from crawl_trulia.urlencoder import urlencoder
>>> from crawl_trulia.htmlparser import htmlparser
>>> from crawlib.spider import spider # install crawlib first

# use address, city and zipcode
>>> address = "22 Yew Rd"
>>> city = "Baltimore"
>>> zipcode = "21221"

>>> url = urlencoder.by_address_city_and_zipcode(address, city, zipcode)
>>> html = spider.get_html(url)
>>> house_detail_data = htmlparser.get_house_detail(html)
>>> house_detail_data
{
"features": {},
"public_records": {
"AC": "a/c",
"basement_type": "improved basement (finished)",
"bathroom": 2,
"build_year": 1986,
"county": "baltimore county",
"exterior_walls": "siding (alum/vinyl)",
"heating": "heat pump",
"lot_size": 7505,
"lot_size_unit": "sqft",
"partial_bathroom": 1,
"roof": "composition shingle",
"sqft": 998
}
}

# usually combination of address and zipcode is enough
>>> address = "2004 Birch Rd"
>>> zipcode = "21221"

>>> url = urlencoder.by_address_and_zipcode(address, zipcode)
>>> html = spider.get_html(url)
>>> house_detail_data = htmlparser.get_house_detail(html)

Install
crawl_trulia is released on PyPI, so all you need is:
$ pip install crawl_trulia
To upgrade to latest version:
$ pip install --upgrade crawl_trulia

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

crawl_trulia 0.0.4

Languages

Categories

Description:

License

Share

Customer Reviews

License

Overview

What you can do with it

What you can't do with it

Related Products

Views For YouTube Bot writed on Python

AI-Web-Scraper

quivr

roop

zed

Awesome Game Analysis

Blender For UnrealEngine Addons

holodeck develop

aws-cli

django framework

fastapi

flask framework

jaxley

lorrystream

pipBERT

prettypretty python

kubernetes client python

The Algorithms Python

pyneurgen

a2pm python package

More From This Creator

datadict 1.0.0

cynes 0.1.0

cuhnsw 0.0.8

CryptoParser 0.12.5

crossbarhttprequests 0.1.6

crx-unpack 0.1.4

crawl_trulia 0.0.4

cortex-deploy 0.20.0.dev0

comotore 19.12.2

colortrans 1.0.0

colcon-notification 0.3.0

colcon-mixin 0.2.3

colcon-metadata 0.2.5

colcon-common-extensions 0.3.0

colcon-clean 0.2.1

colcon-cd 0.1.1

colcon-bash 0.5.0

coffeine 0.1

cog-reloader 0.0.2

cmeel 0.53.3