Granica logo

Research Scientist – Tabular & Structured Machine Learning

Granica
4 days ago
Full-time
On-site
San Francisco Bay Area, California, United States
$160 - $250 USD yearly
Science & Research

Research Scientist – Tabular & Structured Machine Learning

The Mission

AI today is limited not only by model design but by the inefficiency of the data that feeds it. At scale, each redundant byte, poorly organized dataset, and inefficient data path slows progress and compounds into enormous cost, latency, and energy waste.

Granica’s mission is to remove that inefficiency. We combine advances in information theory, probabilistic modeling, and distributed systems to design self-optimizing data infrastructure: systems that continuously improve how information is represented, compressed, and used by AI.

Granica’s research group is led by Prof. Andrea Montanari (Stanford), bridging advances in learning theory and information efficiency with large-scale distributed systems. Together, we share a conviction that the next leap in AI will come not only from larger models, but from more efficient learning systems and better data representations.

Most modern AI research focuses on text, images, or video. Granica’s work focuses on the far less explored but economically critical domain of large-scale structured and tabular data, which powers the majority of enterprise decision-making systems.

Granica is pioneering a new class of structured AI models: foundational models built to learn and reason from relational, tabular, and structured data. While others focus on unstructured text or media, we are exploring the next frontier: systems that understand and reason over the structured information that runs the global economy.

This role focuses specifically on machine learning for structured and tabular data rather than general LLM application development.

What You’ll Build and Research

  • Invent and prototype algorithms that advance the foundations of machine learning for structured and tabular data

  • Develop new representation learning techniques and information models for large enterprise datasets

  • Build adaptive learners combining statistical learning theory, probabilistic modeling, and large-scale systems optimization

  • Contribute to the development of large tabular models and structured foundation models

  • Design architectures integrating relational, symbolic, and neural learning components

  • Research and implement methods for dataset compression, selection, and representation to improve learning efficiency

  • Develop cost models and optimization frameworks for large-scale structured learning systems

  • Collaborate closely with the Granica research group led by Prof. Andrea Montanari (Stanford) and with systems engineers

  • Rapidly prototype new algorithms and evaluate them on real enterprise datasets

  • Publish and contribute to the broader research community shaping the future of structured AI and efficient ML systems

What You’ll Bring

  • PhD in Machine Learning, Statistics, Computer Science, Applied Mathematics, or a related field

  • Research experience related to structured, relational, or tabular data

  • Experience in one or more of the following areas:

    • Tabular or relational machine learning

    • Representation learning for structured data

    • Statistical learning theory or generalization

    • Probabilistic modeling or Bayesian inference

    • Optimization for machine learning

    • Scalable or distributed ML systems

  • Experience working with structured datasets or relational data systems

  • Strong grounding in statistics, optimization, information theory, or probabilistic inference

  • Hands-on experience with PyTorch, JAX, or TensorFlow

  • Strong programming skills in Python or Rust

  • Demonstrated ability to translate theoretical ideas into working systems or prototypes

  • Curiosity about how structure and relational information enable new forms of learning and reasoning

  • A pragmatic research mindset: you value elegant ideas but also ship systems that work at scale

Bonus

  • Research in tabular machine learning, relational representation learning, or structured data modeling

  • Experience building large-scale ML infrastructure or distributed training systems

  • Familiarity with data systems, query engines, or dataset optimization pipelines

  • Publications at top venues such as NeurIPS, ICML, ICLR, COLT, KDD, AAAI

  • Contributions to open-source ML systems or research-to-production tooling

Compensation & Benefits

  • Competitive salary, meaningful equity, and substantial bonus for top performers

  • Flexible time off plus comprehensive health coverage for you and your family

  • Support for research, publication, and deep technical exploration

At Granica, you will shape the fundamental infrastructure that makes intelligence itself efficient, structured, and enduring. Join us to build the foundational data systems that power the future of enterprise AI!