https://bayt.page.link/BJwbB2PMqSFuFgYAA
Create a job alert for similar positions

Job Description

Job Description

Engineer / Senior Engineer / Principal Engineer - Web scraping, Python - Pune, India

REF29681T 

"The very rapid development of e-commerce today gives access to thousands of information. These data are difficult to exploit by companies, which then have difficulty choosing the right levers of action and measuring their impact.This is where Data Impact by NielsenIQ comes in! Every day, we collect more than 60 billion pieces of information, process them and use them in innovative monitoring and action tools for professionals in the sector. Our goal: to give our customers and consumers real-time visibility into the market. Today: Data Impact by NielsenIQ is a leading start-up in the ‘Retail Analytics’ sector.


About the Job
In full growth, particularly internationally, we are looking for new collaborators to join our fabulous team! A young but experienced, dynamic and complementary team: a resolutely start-up spirit! Real job and career opportunities A friendly atmosphere and a climate of trust that promotes autonomy and challenge!"


Responsibilities: 


  • Responsible for the capture of massive data on the web and mobile terminals, and the design of architectures such as extraction, deduplication, classification, clustering, and filtering; 
  • Responsible for the design and development of distributed web crawlers, able to independently solve various problems encountered in the actual development process; 
  • Responsible for the research and development of web page information extraction technology algorithms to improve the efficiency and quality of data capture; 
  • Responsible for the analysis and warehousing of crawled data, monitoring of the crawler system and abnormal alarms; 
  • Responsible for designing and developing data collection strategies and anti-shielding rules to improve the efficiency and quality of data collection; 
  • Responsible for the design and development of core algorithms according to the system data processing flow and business function requirements; 

Qualifications

Must Haves:


  • Proficient in Python language, familiar with one or more of the commonly used crawler frameworks, such as Scrapy framework or other Web scraping frameworks, with independent development experience.
  • Have 1-15 years of experience
  • Familiar with vertical search crawlers and distributed web crawlers, deeply understand the principles of web crawlers, have rich experience in data crawling, parsing, cleaning, and storage related projects, and master anti-crawler technology and breakthrough solutions. 
  •  Master the basic operation of linux,
  •  Experience in distributed crawler architecture design, IP farms and proxy is preferred.
  •  A solid foundation in data structure and algorithms is preferred.

Good to have:


  • Familiar with common data storage and various data processing technologies are preferred.
  • Familiar with commonly used frameworks such as ssh, multi-threading, network communication programming related knowledge.
  • Familiar with at least one RDBMS and non-structure DB technologies.
  • Hands-on experience for crawling any eCommerce platform is a big plus.

Additional Information
  • Enjoy a flexible and rewarding work environment with peer-to-peer recognition platforms. 
  • Recharge and revitalize with help of wellness plans made for you and your family. 
  • Plan your future with financial wellness tools. 
  • Stay relevant and upskill yourself with career development opportunities. 

Our Benefits


  • Flexible working environment
  • Volunteer time off
  • LinkedIn Learning
  • Employee-Assistance-Program (EAP)

About NIQ


NIQ is the world’s leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most comprehensive consumer insights—delivered with advanced analytics through state-of-the-art platforms—NIQ delivers the Full View™. NIQ is an Advent International portfolio company with operations in 100+ markets, covering more than 90% of the world’s population.


For more information, visit NIQ.com


Want to keep up with our latest updates?


Follow us on: LinkedIn | Instagram | Twitter | Facebook


Our commitment to Diversity, Equity, and Inclusion


NIQ is committed to reflecting the diversity of the clients, communities, and markets we measure within our own workforce. We exist to count everyone and are on a mission to systematically embed inclusion and diversity into all aspects of our workforce, measurement, and products. We enthusiastically invite candidates who share that mission to join us. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class. Our global non-discrimination policy covers these protected classes in every market in which we do business worldwide. Learn more about how we are driving diversity and inclusion in everything we do by visiting the NIQ News Center: https://nielseniq.com/global/en/news-center/diversity-inclusion




You have reached your limit of 15 Job Alerts. To create a new Job Alert, delete one of your existing Job Alerts first.
Similar jobs alert created successfully. You can manage alerts in settings.
Similar jobs alert disabled successfully. You can manage alerts in settings.