Senior Data Scientist

Straive · Bengaluru, Karnataka, India

Full-time · Senior · Posted 9 days ago

About This Job

Straive

Location: Bengaluru, Karnataka, India

Work Mode: On-site

Industry: Data Infrastructure and Analytics

Job Description

Dear Candidate,

Straive is hiring for Senior Data Scientist, Please refer the JD below for your reference.

Senior Data Scientist :-

Experience: 6+ Years AI/ML Space (2+ Years specifically in Generative AI/LLMs)

Location: Hyderabad, Bangalore

Type: Full-time

About The Role

We are looking for a high-impact Senior Data Scientist with a strong engineering foundation to join our AI team. You will go beyond standard analytics, building and optimizing production-grade Generative AI solutions. You will be responsible for creating RAG systems, integrating multimodal models, and implementing agentic workflows that turn unstructured data into business insights and automated actions.

Responsibilities

Production RAG Development: Build and optimize end-to-end RAG systems, implementing Hybrid Search (dense vector embeddings + BM25) and advanced reranking strategies to maximize retrieval precision.Multimodal AI Workflows: Integrate Vision-Language Models (e.g., GPT-4o, Claude 3.5, Idefics) to analyze visual data, documents, and images for automated insights extraction.Advanced Prompt Engineering: Develop, test, and maintain complex prompt strategies (Chain-of-Thought, ReAct) using OpenAI/Anthropic APIs and the Hugging Face ecosystem.Vector Database Management: Design indexing strategies and manage high-dimensional data at scale within vector stores such as Pinecone, Weaviate, Milvus, or Pgvector.High-Performance Python Engineering: Write scalable, asynchronous Python code (FastAPI/asyncio) to handle high-throughput AI API calls and API-based data processing.Agentic Orchestration & Deployment: Use LangChain or LlamaIndex to orchestrate LLM workflows and deploy containerized applications using Docker and Kubernetes on cloud platforms (AWS/Azure/GCP).Data Intelligence: Perform advanced NLP preprocessing (semantic chunking, metadata filtering) to improve context quality for LLMs.

Technical Requirements

Experience: 6+ years in AI/ML, with 2+ years of hands-on experience building/deploying Generative AI applications.Languages: Expert-level Python programming (asynchronous, OOPs, high-performance).GenAI/LLM Ecosystem: Deep proficiency with OpenAI, Anthropic, LangChain, LlamaIndex, and Hugging Face.RAG & Search: Hands-on experience with vector databases (Pinecone, etc.) and Hybrid Search implementation.Multimodal: Experience with VLMs for document intelligence or image understanding.Cloud & DevOps: Containerization (Docker) and Cloud AI services (AWS/Azure/GCP)

Sign up to apply