On MS Azure/AWS/GCP Cloud knowledge •Experience in development of Information extraction pipelines (OCR, NLP, text/layout segmentation) using Open-source tools/Models (ex. tesseract, PaddleOCR, etc.) or Deep Neural Network (Yolo, Detectron, etc) •Experience in fine tuning of pretrained models (pytorch or tensorflow) •Experience in Data Modeling and Semantic search using sparse and dense … analysis tools (e.g., SQL, R & Python), database concepts/reporting & Data Science concepts •Information extraction pipelines (OCR, NLP, text/layout segmentation) using Open-source tools/Models (ex. tesseract, PaddleOCR, etc.) or Deep Neural Network (Yolo, Detectron, etc) •Pretrained models (pytorch or tensorflow) •Data Modeling and Semantic search using sparse and dense indexes. LLM indexing tools and libraries (Langchain More ❯
data processing with machine translation, speech-to-text, and OCR tools. -Experience with installation, configuration, and optimization of Machine Translation software (Systran, Language Weaver, Microsoft Translator) and OCR software (Tesseract, AWS Textract). -Experience with AI/ML GPU technologies such as Nvidia. -Extensive knowledge in administering servers with embedded GPUs running on RedHat Linux. Benefits: -Flexible Paid Time Off More ❯
AWS Data Science Environment: Hands-on experience with SageMaker, Lambda, Step Functions, S3, Athena. Model deployment and pipeline orchestration in AWS. OCR Use-Case Development: Proficiency with Amazon Textract, Tesseract, and LLM-based OCR. Building document parsing pipelines, validations, and rules. Python Proficiency: Strong skills with libraries like pandas, NumPy, scikit-learn, PyTorch, and Hugging Face Transformers. Ability to write More ❯
Science Environment: Hands-on experience with Sage Maker, Lambda, Step Functions, S3, Athena. Model deployment and pipeline orchestration in AWS. OCR Use-Case Development: Proficiency with Amazon Tex-tract, Tesseract, and LLM-based OCR. Building document parsing pipelines, validations, and rule Python Proficiency: Strong coding skills using libraries like pandas, NumPy, scikit-learn, PyTorch, and Hugging Face Transformers. Writing clean More ❯
Milton Keynes, Buckinghamshire, South East, United Kingdom Hybrid / WFH Options
LA International Computer Consultants Ltd
Science Environment: Hands-on experience with Sage Maker, Lambda, Step Functions, S3, Athena. Model deployment and pipeline orchestration in AWS. OCR Use-Case Development: Proficiency with Amazon Tex-tract, Tesseract, and LLM-based OCR. Building document parsing pipelines, validations, and rule Python Proficiency: Strong coding skills using libraries like pandas, NumPy, scikit-learn, PyTorch, and Hugging Face Transformers. Writing clean More ❯
may involve out of hours working. To provide clear, well written technical documentation for a variety of purposes. To maintain accurate, real time records of site based interactions in Tesseract Service Centre providing customers and other G4S employees with all relevant information in a timely manner (call log numbers, progress updates etc.) To maintain accurate and timely records of all More ❯