Imhotep - Data Analytics Platform by Indeed

Introduction

Imhotep is a large-scale analytics platform built by Indeed. This platform enables you to perform fast, interactive, ad hoc queries and aggregate results for large datasets.

Additionally, you can combine results from multiple time-series datasets and build your own data tools for analysis, monitoring, reporting, and automated data processing.

Page Contents

Created by gh-md-toc

Tech Talk Resources

The following Indeed Tech Talks describe Imhotep in more detail:

Tech Talk Description
Interactive Analytics with Imhotep Provides an overview of Imhotep and how you can use it to get the most out of your data.
Machine Learning at Indeed: Scaling Decision Trees Describes how Indeed developed Imhotep, a distributed system for building decision trees for machine learning.
Imhotep: Large-Scale Analytics and Machine Learning at Indeed Describes Imhotep’s primitive operations that allow us to build decision trees, drill into data, build graphs, and even execute SQL-like queries in IQL (Imhotep Query Language). The talk also explains what makes Imhotep fast, highly available, and fault tolerant.
Large-Scale Interactive Analytics with Imhotep Demonstrates how our engineering and product organizations use Imhotep to focus on key metrics at scale.


How Indeed Uses Imhotep

At Indeed, we use Imhotep to answer the following and many more questions about how people around the world are using our job search engine:

  • How many unique job queries were performed on a specific day in a specific country?
  • What are the top 50 queries in a specific country? How many times did job seekers click on a search result for each of those queries?
  • Which job titles have the highest click-through rate for the query Architecture in the US?
  • Which Architecture in the US queries have the lowest click-through rate for job titles?

Getting Started

See the quick start page for instructions.

Discussion

Ask and answer questions in our Q&A forum for Imhotep: indeedeng-imhotep-users

See Also for Resources

Apache Hadoop with Pig
Druid
OpenDremel