Sr AI Engineer

Paramus, NJ
Date Posted:20-May-2026
Work Type:On-Site
Job Number:485683

Job Description

Job Title: Senior AI Engineer – Google AI & Generative Intelligence
Duration: 6 Months (Temp-to-Hire)
Location: Paramus, NJ [Hybrid]
 
Role Overview
We are seeking a highly experienced Senior AI Engineer with deep expertise in Google AI technologies, Generative AI. The ideal candidate brings 10–15 years of broad software engineering experience, with the last 4+ years focused exclusively on Artificial Generative Intelligence, including designing, building, deploying, and monitoring production-grade AI systems. This role demands mastery of the Google ecosystem — including Google Workspace, Google Agent Development Kit (ADK), and Vertex AI — alongside a strong command of modern LLM/SLM frameworks, cloud-native infrastructure, and MLOps best practices.
 
Key Responsibilities
1. Large & Small Language Model Engineering

  • Design, develop, and deploy Agents leveraging commercial LLMs such as Gemini (Google), GPT (OpenAI), and Claude Sonnet (Anthropic) for high-performance, large-context, and multimodal tasks.
  • Work with open-source/self-hosted LLMs including Mixtral (Mistral AI).
  • Architect and implement SLM-based solutions using lightweight models such as Phi-3 (Microsoft), Gemma (Google), and Mistral for resource-constrained environments.
  • Lead fine-tuning and customization of models using Vertex AI Tuning, Hugging Face Transformers, and parameter-efficient fine-tuning (PEFT) methods including LoRA and QLoRA.
  • Apply training frameworks such as PyTorch, TensorFlow, or JAX for model experimentation and development.
  • Generate synthetic data and evaluate models using HELM, lm-evaluation-harness, and custom benchmarks.
 
2. Google AI & Workspace Integration
  • Lead the design and implementation of AI-powered solutions deeply integrated with Google Workspace (Docs, Sheets, Drive, Gmail, Meet), Big Query and Lakehouse.
  • Architect and build intelligent agents and workflows using Google Agent Development Kit (ADK).
  • Leverage Google AI Studio as the primary IDE, VSCode for AI application development and prototyping.
  • Utilize Google Cloud Platform (GCP) services including:
  • Vertex AI for ML model training, tuning, and deployment
  • GKE (Google Kubernetes Engine) for container orchestration
  • Cloud Run for serverless deployment
  • Cloud Functions for event-driven AI tasks
  • Vertex AI Vector DBs for semantic search and retrieval
 
3. Design & Planning
  • Lead requirements gathering using Confluence for documentation and team collaboration.
  • Create detailed system architecture diagrams and AI workflows using Lucidchart.
  • Design UI/UX prototypes in Figma for AI-powered application interfaces.
  • Manage project delivery and sprint planning using Jira.
  • Oversee data preparation and management: cleaning, transforming, and organizing data for AI/ML workflows.
  • Conduct data analysis using Jupyter Notebooks and pandas for exploration and preprocessing.
  • Leverage Hugging Face Model Hub for model comparison, selection, and download.
 
4. Development Frameworks & Tools
  • Orchestrate LLM/SLM applications using LangChain, LlamaIndex, and LangGraph.
  • Build multi-agent systems with Semantic Kernel, and LangGraph.
  • Manage and optimize prompts using LangSmith and PromptLayer.
  • Deploy models locally with Ollama or at scale with vLLM for efficient inference.
  • Track experiments, metrics, and results with MLflow or Weights & Biases.
  • Manage code and data versioning with Git.
 
5. Vector Databases & Semantic Search
  • Implement semantic search and Retrieval-Augmented Generation (RAG) pipelines using Vertex AI Vector DBs and ChromaDB.
  • Design and optimize end-to-end RAG architectures for enterprise-grade knowledge retrieval.
 
6. Backend Development
  • Develop robust RESTful APIs using FastAPI (Python) or Express.js (Node.js).
  • Manage and secure APIs using Mulesoft, Apigee.
 
7. Frontend Development
  • Build modern user interfaces using React or Angular.
  • Utilize Material-UI for consistent, accessible, and modern UI components.
  • Prototype and plan UI/UX workflows using Figma.
 
8. Development Tools & Code Quality
  • Write and debug code in VS Code with Python and GitHub Copilot extensions.
  • Leverage GitHub Copilot for AI-assisted code suggestions and productivity.
  • Manage source code with GitHub or GitLab.
  • Enforce code quality and standards using SonarQube, ESLint, and Pylint.
 
9. Testing & Quality Assurance
  • Conduct LLM-specific testing using RAGAS and DeepEval for LLM/RAG pipeline evaluation.
  • Use LangSmith Evaluators for prompt testing and hallucination detection.
  • Write and execute unit tests using pytest.
  • Ensure output quality and reliability using LangChain Evaluators and custom metrics.
 
10. Deployment & Infrastructure
  • Orchestrate containers at scale with Kubernetes (K8s), and Google GKE.
  • Automate CI/CD pipelines using GitHub Actions or GitLab CI.
  • Support on-premise, cloud (GCP/Vertex AI), and hybrid infrastructure deployments including edge devices for local inference.
 
11. LLM Monitoring & Observability
  • Monitor LLM performance and usage with LangSmith and Weights & Biases.
  • Track and optimize AI infrastructure costs using OpenMeter and custom dashboards.
  • Set up continuous evaluation pipelines to ensure ongoing model quality and reliability.
  • Monitor application and model performance end-to-end with LangSmith observability tools.
 
Required Qualifications:
  • 10–15 years of overall software engineering experience.
  • 5+ years of hands-on experience in Artificial Generative Intelligence, including LLMs, SLMs, RAG, and multi-agent systems.
  • Deep expertise in Google AI ecosystem: Gemini, Vertex AI, Google ADK, Google AI Studio, and Google Workspace integrations.
  • Proficiency in Python (primary) and familiarity with Node.js.
  • Strong background in cloud-native development on GCP.
  • Demonstrated experience with model fine-tuning (LoRA, QLoRA, PEFT) and model evaluation frameworks.
  • Solid understanding of MLOps, CI/CD for AI systems, and production deployment best practices.
  • Experience with multi-agent AI architectures using Semantic Kernel, or LangGraph.
 
Preferred Qualifications:
  • Google Cloud Professional certifications ( Professional ML Engineer, Professional Cloud Architect).
  • Contributions to open-source AI/ML projects.
  • Experience with edge AI deployments and hybrid cloud-edge inference.
  • Familiarity with synthetic data generation pipelines.
  • Prior experience mentoring junior engineers or interns in AI/ML domains.
 

Applicant Notices & Disclaimers
  • For information on benefits, equal opportunity employment, and location-specific applicant notices, click here


At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position's starting pay is: $97.00/hr.