Research Scientist – Tabular & Structured Machine Learning

Granica

4 days ago

Full-time

On-site

San Francisco Bay Area, California, United States

$160 - $250 USD yearly

Science & Research

Research Scientist – Tabular & Structured Machine Learning

The Mission

AI today is limited not only by model design but by the inefficiency of the data that feeds it. At scale, each redundant byte, poorly organized dataset, and inefficient data path slows progress and compounds into enormous cost, latency, and energy waste.

Granica’s mission is to remove that inefficiency. We combine advances in information theory, probabilistic modeling, and distributed systems to design self-optimizing data infrastructure: systems that continuously improve how information is represented, compressed, and used by AI.

Granica’s research group is led by Prof. Andrea Montanari (Stanford), bridging advances in learning theory and information efficiency with large-scale distributed systems. Together, we share a conviction that the next leap in AI will come not only from larger models, but from more efficient learning systems and better data representations.

Most modern AI research focuses on text, images, or video. Granica’s work focuses on the far less explored but economically critical domain of large-scale structured and tabular data, which powers the majority of enterprise decision-making systems.

Granica is pioneering a new class of structured AI models: foundational models built to learn and reason from relational, tabular, and structured data. While others focus on unstructured text or media, we are exploring the next frontier: systems that understand and reason over the structured information that runs the global economy.

This role focuses specifically on machine learning for structured and tabular data rather than general LLM application development.

What You’ll Build and Research

Invent and prototype algorithms that advance the foundations of machine learning for structured and tabular data
Develop new representation learning techniques and information models for large enterprise datasets
Build adaptive learners combining statistical learning theory, probabilistic modeling, and large-scale systems optimization
Contribute to the development of large tabular models and structured foundation models
Design architectures integrating relational, symbolic, and neural learning components
Research and implement methods for dataset compression, selection, and representation to improve learning efficiency
Develop cost models and optimization frameworks for large-scale structured learning systems
Collaborate closely with the Granica research group led by Prof. Andrea Montanari (Stanford) and with systems engineers
Rapidly prototype new algorithms and evaluate them on real enterprise datasets
Publish and contribute to the broader research community shaping the future of structured AI and efficient ML systems

What You’ll Bring

PhD in Machine Learning, Statistics, Computer Science, Applied Mathematics, or a related field
Research experience related to structured, relational, or tabular data
Experience in one or more of the following areas:
- Tabular or relational machine learning
- Representation learning for structured data
- Statistical learning theory or generalization
- Probabilistic modeling or Bayesian inference
- Optimization for machine learning
- Scalable or distributed ML systems
Experience working with structured datasets or relational data systems
Strong grounding in statistics, optimization, information theory, or probabilistic inference
Hands-on experience with PyTorch, JAX, or TensorFlow
Strong programming skills in Python or Rust
Demonstrated ability to translate theoretical ideas into working systems or prototypes
Curiosity about how structure and relational information enable new forms of learning and reasoning
A pragmatic research mindset: you value elegant ideas but also ship systems that work at scale

Bonus

Research in tabular machine learning, relational representation learning, or structured data modeling
Experience building large-scale ML infrastructure or distributed training systems
Familiarity with data systems, query engines, or dataset optimization pipelines
Publications at top venues such as NeurIPS, ICML, ICLR, COLT, KDD, AAAI
Contributions to open-source ML systems or research-to-production tooling

Compensation & Benefits

Competitive salary, meaningful equity, and substantial bonus for top performers
Flexible time off plus comprehensive health coverage for you and your family
Support for research, publication, and deep technical exploration

At Granica, you will shape the fundamental infrastructure that makes intelligence itself efficient, structured, and enduring. Join us to build the foundational data systems that power the future of enterprise AI!

Apply now

Research Scientist – Tabular & Structured Machine Learning

Research Scientist – Tabular & Structured Machine Learning

The Mission

What You’ll Build and Research

What You’ll Bring

Bonus

Compensation & Benefits

More jobs

Chemist

Vision Administration

Associate Course Director, Biochemistry

Touro University