Senior LLM Engineer [up to 25k]
Salary undisclosed
Apply on
Original
Simplified
about the company
our client is an AI research and development studio focused on Local AI, small language models, and multi-modality technologies.
about the role
you will be fine-tuning and training LLM models.
about the job
- fine-tune and train foundational LLM models using methods like PEFT, LoRA, and QLoRA
- develop and manage LLM applications and infrastructure aligned with business requirements
- design scalable LLM inference systems
- explore and implement leading tools in the LLM ecosystem (e.g., Vector databases, LlamaIndex)
- research emerging LLM use-cases (e.g. RAG, Agents, etc)
- work closely with LLM research teams on foundation model projects aimed at productivity-related LLM solutions.
knowledge, skills and experience
- background in working with LLMs, including well-known foundation models such as Llama2 and MPT
- hands-on experience in training and fine-tuning foundational LLM models
- proficiency with quantization methods like llama.cpp and GPTQ
- expertise in LLM-related development, including tools like LlamaIndex, LangChain, Vector DBs, and Prompt engineering
- experience deploying LLMs in production environments (e.g., Triton Inference Server) would be advantageous
how to apply
interested candidates may contact Hua Hui at +6017 960 0313 for a confidential discussion. Similar Jobs