Senior LLM Engineer [up to 25k]

Salary undisclosed

Apply on

Original

Simplified

about the company

our client is an AI research and development studio focused on Local AI, small language models, and multi-modality technologies.

about the role

you will be fine-tuning and training LLM models.

about the job

fine-tune and train foundational LLM models using methods like PEFT, LoRA, and QLoRA
develop and manage LLM applications and infrastructure aligned with business requirements
design scalable LLM inference systems
explore and implement leading tools in the LLM ecosystem (e.g., Vector databases, LlamaIndex)
research emerging LLM use-cases (e.g. RAG, Agents, etc)
work closely with LLM research teams on foundation model projects aimed at productivity-related LLM solutions.

knowledge, skills and experience

background in working with LLMs, including well-known foundation models such as Llama2 and MPT
hands-on experience in training and fine-tuning foundational LLM models
proficiency with quantization methods like llama.cpp and GPTQ
expertise in LLM-related development, including tools like LlamaIndex, LangChain, Vector DBs, and Prompt engineering
experience deploying LLMs in production environments (e.g., Triton Inference Server) would be advantageous

how to apply

interested candidates may contact Hua Hui at +6017 960 0313 for a confidential discussion.

Similar Jobs

1d ago