Pragmatike
Data Scientist / OCR specialist
Job Description
Job overview:
- Location: Fully remote, EU timezone (CET +/- 2hours)
- Start date: ASAP
- Languages: French is mandatory
- Experience: Minimum 5 years
- Our client: Saas building an MVP with OCR technology
Job Summary:
The ideal candidate will be responsible for designing, developing, and implementing OCR solutions to extract text and data from images and documents. This role requires expertise in OCR technologies and a strong programming background.
Job Responsibilities:
- Development of OCR Algorithms:
- Design and implement OCR algorithms to accurately extract text and data from various sources like images and scanned documents.
- Optimize algorithms for speed, accuracy, and reliability.
- Integration with Applications:
- Integrate OCR solutions with existing applications and systems to automate data extraction processes.
- Collaborate with software developers to ensure seamless integration of OCR functionalities.
- Selection of OCR Tools:
- Evaluate and choose appropriate OCR tools and libraries based on project requirements.
- Stay updated on the latest OCR technologies and tools in the industry.
- Training and Testing:
- Train OCR models using machine learning techniques to enhance recognition accuracy.
- Conduct thorough testing and validation of OCR solutions to ensure high precision and recall rates.
- Customization and Configuration:
- Customize OCR solutions to meet specific project requirements and handle various document formats.
- Configure OCR parameters for optimal performance in different scenarios.
- Performance Optimization:
- Optimize OCR solutions for performance, scalability, and resource efficiency.
- Implement parallel processing and other techniques to improve processing speed.
- Documentation:
- Create and maintain comprehensive documentation for OCR algorithms, configurations, and integration procedures.
- Provide documentation for troubleshooting and support purposes.
Qualifications:
- Bachelor degree in Computer Science, Software Engineering, or a related field.
- Proven experience in developing OCR solutions, including algorithm design and integration.
- Strong programming skills in languages like Python, Java, or C++.
- Knowledge of OCR libraries and frameworks such as Tesseract, ABBYY, Google Cloud Vision API.
- Understanding of machine learning concepts related to OCR is a plus.
- Familiarity with image processing techniques and formats.
- Excellent problem-solving and analytical skills.
- Strong communication and collaboration skills.