Remote Machine Learning Engineer - Data Scrapping

Posted

Apply now
Please, let Tractian know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • The Data Science team at TRACTIAN focuses on extracting valuable insights from vast amounts of industrial data.
  • This team uses advanced statistical methods, algorithms, and data visualization techniques to transform raw data into actionable intelligence.
  • The role involves building comprehensive and diverse datasets, including industrial equipment documentation and sensor data like vibration and temperature.
  • Responsibilities include designing and maintaining robust data collection pipelines from various sources, extracting and structuring information from unstructured formats, and handling real-world data challenges.
  • The position requires cleaning, filtering, and validating raw data to ensure high quality and usability.
  • The role also involves developing tools to support data collection workflows and collaborating with engineering and product teams to optimize data storage and access patterns.
  • Documentation of data sources, collection methodologies, and processing procedures is essential for reproducibility.

Requirements:

  • Candidates should have 0–2 years of experience in software development, data engineering, or related fields.
  • A degree in Computer Science, Computer Engineering, Information Systems, or an equivalent technical background is required.
  • Understanding of HTML, CSS selectors, and web page structure is necessary.
  • Strong problem-solving skills and attention to detail are essential.
  • The ability to work in a fast-paced environment and manage shifting priorities is required.
  • Proficiency in Python for data manipulation and automation is necessary.
  • Experience with data extraction tools like requests and BeautifulSoup is expected.
  • Familiarity with REST APIs and the HTTP protocol is required.
  • Experience with data cleaning techniques, including handling missing values and standardizing formats, is necessary.
  • Optional skills include exposure to browser automation tools like Selenium or Playwright.

Benefits:

  • The position offers the opportunity to work in a dynamic and innovative environment focused on data-driven solutions.
  • Employees will have the chance to enhance their skills in data engineering and machine learning applications.
  • The role supports remote work, providing flexibility in the work environment.
  • Team members will collaborate with cross-functional teams, gaining exposure to various aspects of product development and engineering.
  • The company promotes a culture of continuous learning and professional growth.
Apply now
Please, let Tractian know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback