All roles

AI Engineer (RAG Specialist)

Remote · USA Full-time New today

AI Engineer (RAG Specialist) We are looking for a skilled AI Engineer specializing in Retrieval-Augmented Generation (RAG) to join our team. Your primary focus will be bridging the gap between static LLMs and dynamic, proprietary data. You won't just be "calling an API"; you will be architecting the entire data lifecycle-from ingestion and chunking strategies to advanced retrieval and response synthesis. The ideal candidate understands that the secret to a great RAG system isn't just the LLM, but the quality of the retrieval and the nuances of the vector database. US Citizenship Required

Key Responsibilities

  • Pipeline Architecture: Design and deploy end-to-end RAG pipelines using frameworks like LangChain, LlamaIndex, or Haystack.
  • Data Engineering: Develop robust ETL processes to ingest unstructured data (PDFs, docs, web scrapes) into high-performance vector stores.
  • Retrieval Optimization: Implement and tune advanced retrieval techniques, including Hybrid Search (keyword + semantic), Re-ranking (Cross-Encoders), and Parent-Document Retrieval.
  • Vector Database Management: Manage and scale vector databases such as Pinecone, Weaviate, Milvus, or Chroma.
  • Evaluation & Benchmarking: Establish rigorous evaluation frameworks (e.g., RAGAS, TruLens) to measure faithfulness, relevancy, and hit rates.
  • Performance Tuning: Optimize embedding models and prompt engineering to reduce latency and "hallucinations."

Technical Qualifications

  • Language Proficiency: Advanced Python (preferred) or TypeScript.
  • LLM Expertise: Hands-on experience with OpenAI GPT-4, Anthropic Claude, or open-source models like Llama 3 via Ollama or vLLM.
  • Vector Expertise: Deep understanding of embeddings, similarity metrics (Cosine, Euclidean), and indexing strategies (HNSW, IVF).
  • NLP Fundamentals: Familiarity with tokenization, context windows, and attention mechanisms.
  • Cloud/DevOps: Experience deploying AI applications on AWS, GCP, or Azure using Docker/Kubernetes.

Preferred Skills

  • Experience with Agentic RAG (Multi-step reasoning and tool-use).
  • Knowledge of Graph Databases (Neo4j) for GraphRAG implementations.
  • Contributions to open-source AI projects.
  • Background in traditional Information Retrieval (Elasticsearch/Solr).

Apply tot his job Apply To this Job

Related roles

Founding AI Engineer — Business Automation

Remote · USA Full-time

Software Engineer (Trajectory) (Train AI Models Part Time!)

Remote · USA Full-time

Artificial Intelligence / Machine Learning Engineer

Remote · USA Full-time

Senior Machine Learning Engineer, Dash Agentic AI

Remote · USA Full-time

Senior Data Science Engineer / Senior Machine Learning Engineer

Remote · USA Full-time

Software Engineer, Machine Learning

Remote · USA Full-time

Sr. Machine Learning Engineer (Recommendation Systems)

Remote · USA Full-time

Senior Machine Learning Engineer - Edge AI

Remote · USA Full-time

Senior Staff Machine Learning Engineer, ML Understanding

Remote · USA Full-time

Machine Learning Engineer III

Remote · USA Full-time

Experienced Full Stack Customer Service Representative – Remote Call Center

Remote · USA Full-time

Weekends | Primary Care Physician (FM/IM) | NC | Remote

Remote · USA Full-time

Experienced Customer Care Executive – Driving Excellent Customer Experiences in Electric Car Leasing

Remote · USA Full-time

IT Auditor – FISMA/FedRAMP

Remote · USA Full-time

Experienced Customer Service Representative – Remote Customer Support for arenaflex in Florida

Remote · USA Full-time

Steuerfachkraft (m/w/d) in Torgelow mindestens 52.000€ - 100% Remote möglich

Remote · USA Full-time

Senior Civil Engineer- Federal Engineering and Design

Remote · USA Full-time

Experienced Customer Service Advisor – Work From Home Opportunity with arenaflex

Remote · USA Full-time

Experienced Virtual Customer Support Associate – Delivering Exceptional Customer Experiences at arenaflex

Remote · USA Full-time

Experienced Remote Data Entry Specialist for Teens - arenaflex Work from Home Opportunity

Remote · USA Full-time