House Hunter
Python package that automates property research by scraping listings and outputting structured data to Google Sheets.
Features
- Smart Web Scraping: Rate-limited requests respecting robots.txt with robust error handling
- NLP Processing: Extracts and standardizes information from varied listing formats
- Google Sheets Integration: Direct output to user-defined spreadsheet structures
- Location Intelligence: Google Distance Matrix API integration for commute time calculations
Tech Stack
- Scraping: Beautiful Soup, Requests, Selenium
- NLP: NLTK, spaCy
- Data: NumPy, Pandas
- APIs: Google Sheets API, Google Distance Matrix API