Wynd Labs
Web Scraping Specialist
NEWRemoteFull-timeGlobal
š Midš Remote
RemoteRemote work position availableActivePosted within the last 30 days
Job Description
[AI-summarized by JobStash]
You will design, build, and maintain robust web scraping pipelines to reliably extract data from complex websites. You will write, test, and refine scraping code, handle dynamic content and pagination, clean and format extracted data, and store it in efficient databases. You will monitor scraping runs, identify and fix failures, and scale jobs using cloud infrastructure and distributed techniques.
Requirements
- āDemonstrated ability to extract data from complex websites
- āProficiency in Python or JavaScript
- āExperience with BeautifulSoup, Scrapy, or Selenium
- āKnowledge of asynchronous programming, multithreading, and distributed scraping
- āIn-depth knowledge of HTML, CSS, JavaScript, and the DOM
- āExperience with NoSQL databases such as MongoDB or Cassandra
- āExperience with cloud services (AWS, Google Cloud, Azure)
- āAbility to apply machine learning for data cleaning or categorization
- āParticipation in open-source projects related to web scraping or data processing
Responsibilities
- āLead data gathering and analysis from online sources
- āWrite, test and refine code to extract data from websites
- āHandle pagination and dynamic content loaded via AJAX
- āClean and format extracted data to meet quality standards
- āStore and manage scraped data in appropriate databases
- āMonitor scraping processes and resolve operational issues
- āOptimize scraping processes for reliability and scale
Benefits & Perks
- āRemote work
- āEquity package
Tech Stack
AJAXCassandramultithreadingGoogle CloudCSSHTMLdatabase managementSeleniummachine learningdistributed scrapingproject:Wynd Network