Web Scraping Python Backend Engineer
doodleblue innovationsResume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
Greetings from Doodleblue Innovations!!
We’re Hiring for Web Scraping Python Backend Engineer
Location: Mumbai
Experience: 3 - 7 years
Notice period: Immediate Joiners
Role Overview
We are looking for an engineer with strong experience in web scraping and data extraction to build systems that collect legal data from various public websites. The role involves building reliable crawlers that extract court judgments, tribunal orders, and regulatory decisions and store them in structured form. You will work closely with the leadership to build the core data acquisition infrastructure of the platform.
Roles and Responsibilities
- Design and build crawlers to extract data from websites.
- Crawl listing pages and extract case metadata.
- Download judgment PDFs and maintain structured storage.
- Build automated pipelines to monitor websites and detect new judgments.
- Extract structured data such as case title, case number, court/bench, date, and judgment document.
- Store scraped data in formats suitable for further processing and search.
Required Skills
- Strong experience with Python.
- Experience with web scraping and crawler development.
- Familiarity with browser automation tools such as Playwright or Scrapy.
- PDF data extraction — pdfplumber, PyMuPDF, Apache Tika or equivalent
- Strong understanding of HTML parsing, pagination handling, and file downloads.
- Knowledge of anti-bot techniques — rate limiting, session rotation, proxy management
Preferred Skills
- Experience with large-scale crawlers.
- Experience working with document datasets.
- Familiarity with PDF extraction tools such as Apache Tika.
- AWS S3 — storing and managing large volumes of raw documents
- Exposure to search systems such as Elasticsearch.
- Experience with AWS MSK / Kafka for event-driven pipelines
Experience
3–7 years of experience in backend or data engineering roles. Prior experience building webcrawlers or scraping systems is highly preferred.
Interested candidates can share their resumes to supriyar@doodleblue.com
Job Types: Full-time, Permanent
Pay: Up to ₹1,200,000.00 per year
Application Question(s):
- How many years of experience in Web Scraping?
- How many years of experience in Python?
- How many years do you have as Web Scraping Backend Engineer?
Experience
- Web Scraping: 3 years (Required)
- Python: 3 years (Required)
Work Location: In person
Want AI-powered job matching?
Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.
Get Started Free