Pragmatike

Data Scientist / OCR specialist

26 October 2024
Apply Now
Deadline date:
£65000 - £122000 / year

Job Description

Job overview:

  • Location: Fully remote, EU timezone (CET +/- 2hours)
  • Start date: ASAP
  • Languages: French is mandatory
  • Experience: Minimum 5 years
  • Our client: Saas building an MVP with OCR technology

Job Summary:

The ideal candidate will be responsible for designing, developing, and implementing OCR solutions to extract text and data from images and documents. This role requires expertise in OCR technologies and a strong programming background.

Job Responsibilities:

  1. Development of OCR Algorithms:
    • Design and implement OCR algorithms to accurately extract text and data from various sources like images and scanned documents.
    • Optimize algorithms for speed, accuracy, and reliability.
  2. Integration with Applications:
    • Integrate OCR solutions with existing applications and systems to automate data extraction processes.
    • Collaborate with software developers to ensure seamless integration of OCR functionalities.
  3. Selection of OCR Tools:
    • Evaluate and choose appropriate OCR tools and libraries based on project requirements.
    • Stay updated on the latest OCR technologies and tools in the industry.
  4. Training and Testing:
    • Train OCR models using machine learning techniques to enhance recognition accuracy.
    • Conduct thorough testing and validation of OCR solutions to ensure high precision and recall rates.
  5. Customization and Configuration:
    • Customize OCR solutions to meet specific project requirements and handle various document formats.
    • Configure OCR parameters for optimal performance in different scenarios.
  6. Performance Optimization:
    • Optimize OCR solutions for performance, scalability, and resource efficiency.
    • Implement parallel processing and other techniques to improve processing speed.
  7. Documentation:
    • Create and maintain comprehensive documentation for OCR algorithms, configurations, and integration procedures.
    • Provide documentation for troubleshooting and support purposes.

Qualifications:

  • Bachelor degree in Computer Science, Software Engineering, or a related field.
  • Proven experience in developing OCR solutions, including algorithm design and integration.
  • Strong programming skills in languages like Python, Java, or C++.
  • Knowledge of OCR libraries and frameworks such as Tesseract, ABBYY, Google Cloud Vision API.
  • Understanding of machine learning concepts related to OCR is a plus.
  • Familiarity with image processing techniques and formats.
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration skills.