Data Scientist ( LLM MLOps ) $150K - $175K San Francisco CA
3 Days Old
4 days ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Direct message the job poster from GroRapid Labs
"Helping Web3 Startups Hire Top Golang & Rust Engineers | Tech Recruiter (US | Europe) About the job
Data Scientist – LLMs, Python, MLOps
Remote | Full-Time
A 2024-founded startup based in San Francisco is building structured data tools to improve the accuracy and reliability of large language models. Their platform powers agentic, RAG-native systems through modular knowledge graphs and developer-friendly APIs, turning unstructured data into useful, trusted knowledge.
What you will do
Turn raw JSON, CSV, or HTML into clean insights. Profile, visualize, and identify patterns or outliers—before anyone asks.
Train and tune models for classification, ranking, and RAG with LLMs to move recall and precision metrics forward every week.
API Integrator
Wrap models using FastAPI, validate inputs with Pydantic, and deploy clean, testable endpoints using CI pipelines.
MLOps Wrangler
Monitor data and model drift, run batch jobs, add simple tests, and ensure long-term system reliability.
Insight Storyteller
Communicate findings through Jupyter notebooks, dashboards, and Loom videos. Make insights clear and accessible to legal and non-technical stakeholders.
Startup Swiss-Army Knife
Take initiative to fix data issues, infra gaps, and edge cases—without waiting for formal tasks or assignments.
You might be a fit if you have
3–5 years of experience with Python and tools like pandas, Polars, PyTorch, or TensorFlow
Experience building and deploying APIs with FastAPI and Pydantic
Practical use of LLMs for data augmentation or cleaning tasks
Proficient in SQL, Postgres/DuckDB, and object storage like S3
Familiarity with CI/CD pipelines (e.g., GitHub Actions)
You document clearly and share proactively
Bonus if you have
Experience with web scraping using Scrapy or Playwright, or working with PACER, NHTSA, or FDA datasets
Familiarity with vector databases like Qdrant or pgvector, and prompt engineering
Exposure to regulated environments like SOC 2, HIPAA, etc.
Why this role
You’ll work at the core of production-grade AI systems—from structured LLM pipelines to real-time API deployment. Perfect for someone who thrives in fast-moving, high-ownership environments and wants to build meaningful, technical systems that make LLMs safer and more reliable.
Seniority level Seniority level Mid-Senior level
Employment type Employment type Full-time
Job function Job function Engineering and Information Technology
Industries Software Development
Referrals increase your chances of interviewing at GroRapid Labs by 2x
Sign in to set job alerts for “Data Scientist” roles. San Francisco, CA $172,000.00-$203,000.00 3 weeks ago
AI Training for Data Science (Freelance, Remote) San Francisco, CA $140,000.00-$195,000.00 3 weeks ago
South San Francisco, CA $120,000.00-$135,000.00 3 weeks ago
Research Scientist (Multi-agent Systems) San Francisco, CA $180,000.00-$220,000.00 3 days ago
Brisbane, CA $161,000.00-$185,000.00 3 days ago
Software Engineer, Python - AI Training (Freelance, Remote) Machine Learning Scientist (Staff / Sr Staff) - Power Markets Internship - Research Scientist (AI Agents) San Francisco, CA $157,500.00-$233,400.00 1 day ago
San Francisco, CA $140,000.00-$200,000.00 2 weeks ago
Machine Learning Engineer, Core Engineering Scientist II, Real World Data Science - Translational Research We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
- Location:
- San Francisco, CA, United States
- Salary:
- $250,000 +
- Category:
- IT & Technology