We are seeking a highly skilled Machine Learning Engineer to enhance and optimize our data extraction pipeline for commercial real estate lease processing.
This role focuses on fine-tuning text classification models, improving training datasets, and working with large volumes of unstructured text data.
The ideal candidate has experience with natural language processing (NLP), model retraining workflows, and cloud-based ML deployment.
Responsibilities include improving and maintaining the data extraction pipeline used for lease document processing.
The candidate will fine-tune and retrain existing ML models for text categorization, currently using TF-IDF and Scikit-learn.
The role involves owning the QA process for ML outputs and continuously optimizing model performance.
The candidate will enhance training datasets to improve model generalization and accuracy.
Collaboration with the team is required to ensure consistent extraction of 15β30 provisions per lease document.
The role includes working with OCR and NLP tools to refine document parsing and classification.
Requirements:
Proven experience in machine learning, with a focus on text classification and document processing is required.
Strong proficiency in Python and core NLP libraries such as spaCy, NLTK, scikit-learn, and transformers is necessary.
Experience with TF-IDF vectorization and traditional ML techniques for text classification is essential.
Familiarity with OCR technologies and PDF parsing tools, such as Marker, is required.
Experience deploying models on AWS and working with APIs like OpenAI (via Azure) and Claude (via Bedrock) is necessary.
Excellent problem-solving skills, attention to detail, and the ability to work independently are required.
English proficiency at the B2-C1 level is necessary.
Benefits:
A stable, long-term contract with opportunities for career growth is offered.
The company promotes a remote-friendly culture that supports work-life balance.
Continuous training, mentorship, and learning programs are provided to keep you at the forefront of the industry.
Free access to AI training resources and state-of-the-art AI tools is available to elevate your daily work.
A flexible Paid Time Off (PTO) policy as well as paid holiday days is included.
The position involves challenging, world-class software projects for clients in the US and LatAm.
Collaboration with some of the most talented software engineers in Latin America and the US, in a diverse work environment, is encouraged.