Sr AI Engineer

Paramus, NJ

Date Posted:20-May-2026

Work Type:On-Site

Job Number:485683

Job Description

Job Title: Senior AI Engineer – Google AI & Generative Intelligence
Duration: 6 Months (Temp-to-Hire)
Location: Paramus, NJ [Hybrid]

Role Overview
We are seeking a highly experienced Senior AI Engineer with deep expertise in Google AI technologies, Generative AI. The ideal candidate brings 10–15 years of broad software engineering experience, with the last 4+ years focused exclusively on Artificial Generative Intelligence, including designing, building, deploying, and monitoring production-grade AI systems. This role demands mastery of the Google ecosystem — including Google Workspace, Google Agent Development Kit (ADK), and Vertex AI — alongside a strong command of modern LLM/SLM frameworks, cloud-native infrastructure, and MLOps best practices.

Key Responsibilities
1. Large & Small Language Model Engineering

Design, develop, and deploy Agents leveraging commercial LLMs such as Gemini (Google), GPT (OpenAI), and Claude Sonnet (Anthropic) for high-performance, large-context, and multimodal tasks.
Work with open-source/self-hosted LLMs including Mixtral (Mistral AI).
Architect and implement SLM-based solutions using lightweight models such as Phi-3 (Microsoft), Gemma (Google), and Mistral for resource-constrained environments.
Lead fine-tuning and customization of models using Vertex AI Tuning, Hugging Face Transformers, and parameter-efficient fine-tuning (PEFT) methods including LoRA and QLoRA.
Apply training frameworks such as PyTorch, TensorFlow, or JAX for model experimentation and development.
Generate synthetic data and evaluate models using HELM, lm-evaluation-harness, and custom benchmarks.

2. Google AI & Workspace Integration

Lead the design and implementation of AI-powered solutions deeply integrated with Google Workspace (Docs, Sheets, Drive, Gmail, Meet), Big Query and Lakehouse.
Architect and build intelligent agents and workflows using Google Agent Development Kit (ADK).
Leverage Google AI Studio as the primary IDE, VSCode for AI application development and prototyping.
Utilize Google Cloud Platform (GCP) services including:
Vertex AI for ML model training, tuning, and deployment
GKE (Google Kubernetes Engine) for container orchestration
Cloud Run for serverless deployment
Cloud Functions for event-driven AI tasks
Vertex AI Vector DBs for semantic search and retrieval

3. Design & Planning

Lead requirements gathering using Confluence for documentation and team collaboration.
Create detailed system architecture diagrams and AI workflows using Lucidchart.
Design UI/UX prototypes in Figma for AI-powered application interfaces.
Manage project delivery and sprint planning using Jira.
Oversee data preparation and management: cleaning, transforming, and organizing data for AI/ML workflows.
Conduct data analysis using Jupyter Notebooks and pandas for exploration and preprocessing.
Leverage Hugging Face Model Hub for model comparison, selection, and download.

4. Development Frameworks & Tools

Orchestrate LLM/SLM applications using LangChain, LlamaIndex, and LangGraph.
Build multi-agent systems with Semantic Kernel, and LangGraph.
Manage and optimize prompts using LangSmith and PromptLayer.
Deploy models locally with Ollama or at scale with vLLM for efficient inference.
Track experiments, metrics, and results with MLflow or Weights & Biases.
Manage code and data versioning with Git.

5. Vector Databases & Semantic Search

Implement semantic search and Retrieval-Augmented Generation (RAG) pipelines using Vertex AI Vector DBs and ChromaDB.
Design and optimize end-to-end RAG architectures for enterprise-grade knowledge retrieval.

6. Backend Development

Develop robust RESTful APIs using FastAPI (Python) or Express.js (Node.js).
Manage and secure APIs using Mulesoft, Apigee.

7. Frontend Development

Build modern user interfaces using React or Angular.
Utilize Material-UI for consistent, accessible, and modern UI components.
Prototype and plan UI/UX workflows using Figma.

8. Development Tools & Code Quality

Write and debug code in VS Code with Python and GitHub Copilot extensions.
Leverage GitHub Copilot for AI-assisted code suggestions and productivity.
Manage source code with GitHub or GitLab.
Enforce code quality and standards using SonarQube, ESLint, and Pylint.

9. Testing & Quality Assurance

Conduct LLM-specific testing using RAGAS and DeepEval for LLM/RAG pipeline evaluation.
Use LangSmith Evaluators for prompt testing and hallucination detection.
Write and execute unit tests using pytest.
Ensure output quality and reliability using LangChain Evaluators and custom metrics.

10. Deployment & Infrastructure

Orchestrate containers at scale with Kubernetes (K8s), and Google GKE.
Automate CI/CD pipelines using GitHub Actions or GitLab CI.
Support on-premise, cloud (GCP/Vertex AI), and hybrid infrastructure deployments including edge devices for local inference.

11. LLM Monitoring & Observability

Monitor LLM performance and usage with LangSmith and Weights & Biases.
Track and optimize AI infrastructure costs using OpenMeter and custom dashboards.
Set up continuous evaluation pipelines to ensure ongoing model quality and reliability.
Monitor application and model performance end-to-end with LangSmith observability tools.

Required Qualifications:

10–15 years of overall software engineering experience.
5+ years of hands-on experience in Artificial Generative Intelligence, including LLMs, SLMs, RAG, and multi-agent systems.
Deep expertise in Google AI ecosystem: Gemini, Vertex AI, Google ADK, Google AI Studio, and Google Workspace integrations.
Proficiency in Python (primary) and familiarity with Node.js.
Strong background in cloud-native development on GCP.
Demonstrated experience with model fine-tuning (LoRA, QLoRA, PEFT) and model evaluation frameworks.
Solid understanding of MLOps, CI/CD for AI systems, and production deployment best practices.
Experience with multi-agent AI architectures using Semantic Kernel, or LangGraph.

Preferred Qualifications:

Google Cloud Professional certifications ( Professional ML Engineer, Professional Cloud Architect).
Contributions to open-source AI/ML projects.
Experience with edge AI deployments and hybrid cloud-edge inference.
Familiarity with synthetic data generation pipelines.
Prior experience mentoring junior engineers or interns in AI/ML domains.

Applicant Notices & Disclaimers

For information on benefits, equal opportunity employment, and location-specific applicant notices, click here

At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position's starting pay is: $97.00/hr.

inShare on LinkedIn ✉Share via Email

Sr AI Engineer

Job Description

Services

Information

Resources

Social

Privacy and Policy

Privacy and Policy