[Remote] Gen AI / Machine Learning Engineer
Note: The job is a remote job and is open to candidates in USA. Precision Technologies is a company focused on cutting-edge technology solutions, and they are seeking a Generative AI / Machine Learning Engineer. The role involves designing, developing, and deploying machine learning and generative AI models for real-world applications while collaborating with cross-functional teams to ensure efficient integration into production systems.
Responsibilities
- Design, develop, and deploy machine learning and generative AI models for real-world applications
- Build and optimize solutions using large language models (LLMs) for tasks like text generation, summarization, and Q&A
- Develop and maintain end-to-end ML pipelines from data ingestion to model deployment
- Fine-tune pre-trained models using frameworks like Hugging Face Transformers
- Implement prompt engineering strategies to improve LLM outputs
- Work with vector databases for semantic search and retrieval-augmented generation (RAG)
- Integrate AI/ML models into scalable APIs and production systems
- Collaborate with data engineers to prepare and process large datasets
- Evaluate model performance and continuously improve accuracy and efficiency
- Deploy models using cloud platforms like AWS, Azure, or GCP
- Monitor models in production and handle model drift and retraining
- Ensure responsible AI practices including fairness, bias mitigation, and security
- Optimize models for latency, scalability, and cost
- Work closely with cross-functional teams including product managers and stakeholders
- Participate in Agile/Scrum development processes
Skills
- Design, develop, and deploy machine learning and generative AI models for real-world applications
- Build and optimize solutions using large language models (LLMs) for tasks like text generation, summarization, and Q&A
- Develop and maintain end-to-end ML pipelines from data ingestion to model deployment
- Fine-tune pre-trained models using frameworks like Hugging Face Transformers
- Implement prompt engineering strategies to improve LLM outputs
- Work with vector databases for semantic search and retrieval-augmented generation (RAG)
- Integrate AI/ML models into scalable APIs and production systems
- Collaborate with data engineers to prepare and process large datasets
- Evaluate model performance and continuously improve accuracy and efficiency
- Deploy models using cloud platforms like AWS, Azure, or GCP
- Monitor models in production and handle model drift and retraining
- Ensure responsible AI practices including fairness, bias mitigation, and security
- Optimize models for latency, scalability, and cost
- Work closely with cross-functional teams including product managers and stakeholders
- Participate in Agile/Scrum development processes
Benefits
- Location: United States — Onsite / Hybrid / Remote
- Employment Type: W2 · Full-time · All immigration statuses accepted
Company Overview
Company H1B Sponsorship
Apply To This Job