ML Interns - Natural Language Processing: Document Understanding
<div id="about-job">
<div>
<div>
<p>
We are developing an AI Platform for the Architecture, Engineering, and Construction (AEC) industry. Our platform leverages advanced AI to enable construction domain experts to create complex use cases efficiently.
</p>
</div>
<h2>
Tasks
</h2>
<div>
<p>
We are looking for
<strong>
full-time interns (for min. 6 months)
</strong>
to solve some cutting-edge machine learning problems and be a part of our product development.
</p>
<p>
You should have experience in implementing machine learning models in PyTorch, and be proficient in Python. Prior experience in document understanding, information extraction,
<strong>
OCR / OCR-free methods
</strong>
, or
<strong>
Retrieval Augmented Generation (RAG)
</strong>
with large-language models would be preferred.
</p>
<p>
The successful candidate will:
</p>
<ul>
<li>
Extract and pre-process data from
<strong>
transactional documents with varying layouts
</strong>
</li>
<li>
Collaborate with ML engineers on model design, experimentation and implementation
</li>
<li>
Collaborate with ML engineers to design a system with state-of-the-art ML components that effectively addresses customer KPIs
</li>
<li>
Discuss requirements with customer-facing members to understand the problem and its constraints
</li>
<li>
Proactively propose and implement iterative improvements
</li>
<li>
Propose and implement metrics to evaluate relevant KPIs
</li>
<li>
Integrate the solutions to a common codebase and demonstrate good software engineering practices
</li>
<li>
Communicate results and analysis on regular basis
</li>
</ul>
</div>
<h2>
Requirements
</h2>
<div>
<ul>
<li>
Pursuing a PhD / Master's degree in Computer Science or a related field enrolled in a German or EU university
</li>
<li>
Experience in implementing machine learning models in PyTorch, specifically Large Language Models
</li>
<li>
Experience with document understanding
</li>
<li>
Experience with Retrieval Augmented Generation (RAG)
</li>
<li>
Experience with OCR methods, OCR-free document understanding methods, e.g., Visual Document Question Answering (visual doc-QA) and Information Extraction
</li>
<li>
Proficient in Python and good software engineering skills
</li>
<li>
Good communication and interpersonal skills
</li>
<li>
Ability to work in a team-oriented environment
</li>
<li>
Strong problem-solving skills
</li>
<li>
Fluent in English
</li>
<li>
<strong>
Eligible to work in Germany
</strong>
</li>
</ul>
</div>
<h2>
Benefits
</h2>
<div>
<p>
You will be a part of an
<strong>
inclusive start-up culture
</strong>
in a stimulating "work hard, play hard" environment. You will work with (and party with) great colleagues with diverse backgrounds. Team events and after-work activities are frequent at CONXAI. You will be empowered to bring new perspectives and create impact.
</p>
</div>
</div>
<div>
<div>
<div>
Updated: 4 days ago
</div>
<div>
</div>
<div>
Job ID: 13064929
</div>
</div>
<div>
<span>
<div>
<div>
<i>
</i>
</div>
<span>
Report issue
</span>
</div>
</span>
</div>
</div>
<div>
</div>
<div>
<div>
<div>
<div>
</div>
<div>
<h2>
Conxai Technologies GmbH
</h2>
<div>
<div>
<div>
<i>
</i>
<div>
11-50 employees
</div>
</div>
<div>
<i>
</i>
<div>
Software Development
</div>
</div>
</div>
</div>
</div>
</div>
<div>
<div>
<p>
CONXAI is a deep tech start-up, with a passion to revolutionize the $12T global construction industry using Domain AI and Frontier Tech.
</p>
</div>
</div>
<div>
<a href="https://www.conxai.com">
<span>
<div>
<i>
</i>
<div>
Website
</div>
</div>
</span>
</a>
<a href="https://www.linkedin.com/company/conxai/">
<span>
<div>
<i>
</i>
<div>
</div>
</div>
</span>
</a>
</div>
</div>
<div>
</div>
<div>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
<div>
</div>
</div>
<div>
<div>
</div>
</div>
</div>
</div>
</div>
</div>
<div>
<h6>
<div>
Our other open positions
<div>
<a href="https://join.com/companies/conxai">
<span>
View all open positions
</span>
</a>
</div>
</div>
</h6>
<div>
<a href="https://join.com/companies/conxai/13047854-senior-qa-test-engineer-ai-b2b-saas">
<span>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
<div>
<h3>
Senior QA Test Engineer (AI B2B SaaS)
</h3>
<div>
</div>
</div>
<div>
<div>
<i>
</i>
Gurgaon, India
</div>
<div>
<i>
</i>
Employee
</div>
<div>
<i>
</i>
Quality Assurance, Inspection
</div>
</div>
</div>
<div>
<span>
<i>
</i>
</span>
</div>
</div>
</div>
</span>
</a>
<a href="https://join.com/companies/conxai/12985441-sr-computer-vision-engineer-india">
<span>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
<div>
<h3>
Sr. Computer Vision Engineer (India)
</h3>
<div>
</div>
</div>
<div>
<div>
<i>
</i>
Gurgaon, India
</div>
<div>
<i>
</i>
Employee
</div>
<div>
<i>
</i>
Project Management
</div>
</div>
</div>
<div>
<span>
<i>
</i>
</span>
</div>
</div>
</div>
</span>
</a>
<a href="https://join.com/companies/conxai/13043354-senior-backend-engineer-golang-aws-cloud-and-api-solutions">
<span>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
<div>
<h3>
Senior Backend Engineer (Golang, AWS) - Cloud and API Solutions
</h3>
<div>
</div>
</div>
<div>
<div>
<i>
</i>
Gurgaon, India
</div>
<div>
<i>
</i>
Employee
</div>
<div>
<i>
</i>
Engineering
</div>
</div>
</div>
<div>
<span>
<i>
</i>
</span>
</div>
</div>
</div>
</span>
</a>
<a href="https://join.com/companies/conxai/13011095-data-science-intern">
<span>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
<div>
<h3>
Data Science Intern
</h3>
<div>
</div>
</div>
<div>
<div>
<i>
</i>
Munich, Germany
</div>
<div>
<i>
</i>
Internship
</div>
<div>
<i>
</i>
Other
</div>
</div>
</div>
<div>
<span>
<i>
</i>
</span>
</div>
</div>
</div>
</span>
</a>
</div>
<a href="https://join.com/companies/conxai/spontaneous-application">
<span>
<div>
<div>
<div>
<div>
<div>
<div>
</div>
</div>
<div>
Spontaneous Application
</div>
</div>
<i>
</i>
</div>
</div>
</div>
</span>
</a>
<div>
<a href="https://join.com/companies/conxai">
<span>
View all open positions
</span>
</a>
</div>
</div>
</div>