Job Description
Roles & Responsibilities
AI/LLM Development & Deployment
Implement, manage, and optimize Large Language Models (LLMs), including those for specialized tasks like image processing and handwriting recognition.
Expertly setup and deploy LLMs on cloud services, ensuring scalable, secure, and performant solutions.
Configure, deploy, and manage LLMs on local servers, ensuring seamless integration with existing on-premise infrastructure and data.
Fine-tune and train LLM models to deliver high-quality, accurate, and contextually relevant outputs tailored to specific business needs.
Conduct thorough troubleshooting and root cause analysis for issues encountered during AI model training, deployment, and operation.
Integrate AI models and local APIs with both external systems and internal applications, ensuring smooth data flow and functional synergy.
Design and develop AI models and solutions leveraging such as OpenAI technologies where appropriate, while also focusing on open-source AI frameworks
such as Langchain.
AI Network & Infrastructure
Develop and implement robust platforms for the monitoring, analysis, and diagnosis of large-scale AI/LLM networks to ensure operational stability and performance.
Research, evaluate, and develop high-performance AI communication frameworks and network protocols to push the boundaries of AI capabilities.
Contribute to building the next-generation AI network infrastructure capable of supporting large-scale heterogeneous network hardware.
Desired Candidate Profile
Knowledge and Experiences:
Bachelor’s degree in IT with 5 years’ experience in a similar function, or Masters’ degree with 3 years’ experience, or as defined in the JD Matrix.
Advanced AI/ML & LLM Expertise:
Bachelor's/Master's in Computer Science/AI/ML, with deep understanding of AI algorithms, data structures, deep learning, NLP, and proven experience in developing and deploying Large Language Models (LLMs) using open- source (e.g., Langchain) and OpenAI frameworks.
Programming & Cloud Proficiency:
Expert in Python, with experience in .Net, TensorFlow, PyTorch, and NodeJS, coupled with hands-on experience deploying AI/LLMs on AWS Cloud (UAE Region) and local servers.
Data Management & Engineering:
Proficient in pre-processing, cleaning, and augmenting large datasets for model training, alongside strong understanding of model evaluation metrics.
Problem-Solving & Collaboration: A proactive approach to technical challenges, excellent analytical skills, and strong teamwork and communication abilities.
Tanqeeb.com is the pioneering search engine in The Arab World. Tanqeeb Gathers all the suitable jobs on various platforms for you in one place.