py-realty 0.1.1

Creator: bradpython12

Last updated:

Add to Cart

Description:

pyrealty 0.1.1

PY-Realty
This is a library designed to scrape publicly available Real Estate listing information.
Specifically, all this library does is send the bare minimum/near bare minimum requests to the public web servers of supported listing websites. Meaning that is is effectively the same as visiting the website (such as Zillow) manually with a web browser with the exception that with PY-Realty you can avoid loading unnecessary JavaScript and media (images, etc.).
That being said scrapping large number of listings with no delays may result in the source (Zillow, Realtor.com, etc.) blocking your IP address (Zillow can probably figure out you're not manually looking at listings if you're querying thousands a minute.), although this has not been tested.
Status
VERSION: 0.1.0
Early development. The library is still in early development and is missing key features and may not be working entirely as expected.
This library is now somewhat usable (mileage may vary) and can be installed (and built) via pip, see installation instructions below.
Road Map

Zillow

Zillow Search Query

Implementation
Manual Testing
Unit Testing


Zillow Listing Details Parsing

Property Sale Listing

Implementation
Manual Testing
Unit Testing


Property Rental Listing

Implementation
Manual Testing
Unit Testing


Apartment Rental Listing

Implementation
Manual Testing
Unit Testing






Realtor.com

Realtor.com Sale Search Query

Implementation
Manual Testing
Unit Testing


Realtor.com Rental Search Query

Implementation
Manual Testing
Unit Testing


Property Sale Listing

Implementation
Manual Testing
Unit Testing


Property Rental Listing

Implementation
Manual Testing
Unit Testing




LandWatch

LandWatch search Query

Implementation
Manual Testing
Unit Testing


LandWatch Listing Details Parsing

Implementation
Manual Testing
Unit Testing




Apartments.com

Apartments.com Search Query

Implementation
Manual Testing
Unit Testing


Apartments.com Listing Details Parsing

Implementation
Manual Testing
Unit Testing





Regarding Unit Tests
Due to the nature of the data, it is impractical to have Unit tests check if data is extracted correctly. If testing against a source of Truth, say a listing, that listing may change over time. Additionally even though Zillow and Realtor.com may have similar data, if there is a mismatch it does not necessarily mean that scraping is done incorrectly, could just be a mismatch between the two websites.
Therefore Unit tests will check against unexpected behavior as opposed to extraction correctness. This meaning that they will check against exceptions and check if value types are correct. That being said the Unit Test criteria still needs to ensure reasonable robustness.
Usage
More detailed documentation will follow as development continues. However attributes and functions are well documented withing class and function doc-strings.
Zillow
Inside zillow the Query class is used to construct and send query request to Zillow's public API. Once the Query is configured as desired, the get_response method will return the search results.
The search results can then be scraped via the functions details.crapes_listing , details.scrape_listings, and details.lazy_scrape_listings.
Installation

Clone this repository
Within the repository folder, run pip install .

License

For personal and professional use. You cannot resell or redistribute these repositories in their original state.

Customer Reviews

There are no reviews.