Senior AI Engineer
Company
Krisp
Category
Job Address
Application Deadline
IT
Yerevan, Armenia
28/03/2026
Responsibilities
- Own the end-to-end LLM pipeline, ensuring scalability, maintainability, and documentation
- Design and optimize LLM inference pipelines for production, ensuring scalability and reliability
- Profile and optimize model performance based on speed, cost, and compute resource utilization
- Work with cloud-based AI services (AWS, GCP, Azure) to manage compute resources efficiently
- Monitor and log model performance, identifying areas for optimization
- Define and implement model evaluation metrics for tracking accuracy, latency, and cost efficiency
- Automate LLM testing (e.g., hallucination detection, bias monitoring, and robustness checks)
- Collaborate closely with the AI QA Engineer and Prompt Engineer to integrate testing, evaluation, and prompt design into the pipeline
Required Qualifications
-Strong Python and ML framework expertise (PyTorch, Hugging Face)
- Deep understanding of LLMs (GPT, Claude, Mistral, etc.) and prompt engineering methodologies
- Experience with vector databases and retrieval-augmented generation (RAG)
- Experience in profiling and optimizing LLM performance (latency, cost, memory usage)
- Strong knowledge of APIs and cloud AI infrastructure
- Familiarity with LLM optimization, and deployment strategies is a plus
- Ability to document pipeline architecture and workflows for cross-team collaboration
Application Procedures
Apply here
https://krispai.notion.site/31392f5cd1bb819fb119d09e50cd2277?pvs=105
Please mention in your application that you have learned about this position from MyJob.am