Entries by Jethro

On-Demand Webinar: How Tata Communications Uses Jethro to Deliver Interactive BI on Hadoop

Learn how Tata Communications CDN overcame Hadoop latency pitfalls to deliver its end users a BOI dashboard on a shared Hadoop data lake with thousands of concurrent users that works at interactive speed. This on-demand webinar is hosted by Tata Communications CDN Head of Engineering, Samik Mukherjee and Jethro CEO Eli Singer. You will learn about […]

5 Reasons Business Intelligence on Hadoop Projects Fail

Business Intelligence (BI) on Hadoop Projects Are Difficult to Execute Successfully Hadoop has its own set of constraints and problems, and when you throw BI into the mix it’s a whole other ballgame. As companies move their EDW to Hadoop, they expect to be able to migrate their analytical applications as well. While workloads such as […]

Eli Singer Discusses How to Make Business Intelligence Work on Hadoop

Jethro CEO Eli Singer discusses the transition from traditional EDWs to Hadoop and the resulting latency issues. He discusses why this is problematic for Business Intelligence use cases and the ideal SQL-on-Hadoop architecture to solve this problem. Eli tells how to enable Tableau, Qlik and MicroStrategy to deliver  business users an interactive business intelligence on […]

Database Architectures for Business Intelligence Demystified: A Concise Comparison

Database Architectures for Business Intelligence Demystified: A Concise Comparison Introduction The year 2008 was one of drastic change in the world economy and in society. Major financial institutions collapsed, while Internet social networking service providers such as Facebook and Twitter (and later Uber and WhatsApp) grew exponentially. This resulted in enormous changes in the database […]

Jethro 1.6.0 Released

The main themes of Jethro 1.6.0 are concurrency and new range-index features. Concurrency features are: Reuse of results when the same “where” clause is used by multiple queries, in order to reduce resource consumption and increase concurrency. Enhanced locking infrastructure to protect against deadlocks during high load. Increase the maximum number of threads allocated by the operating […]

What’s new in JethroData 1.0

As we announced earlier today, JethroData 1.0 was just released. Since launching the public beta six months ago, we have added numerous improvements and bug fixes across the board. I would like to share some highlights: Performance Adaptive Cache – Users typically access Jethro from their dashboards and BI tools. These tools generate SQLs in […]

Partitioning in Hive and Impala Versus Jethro

In my previous post, I explained how partitioning works in Jethro. In this post, I would like to explain how partitioning was implemented in Hive and Impala, why their design is very problematic, and how our implementation avoids those problems. Design matters! Partitioning in Hive / Impala In Hive and Impala, a partition is a […]

Simple, Automatic Range Partitioning in JethroData

This post will introduce how the partitioning feature is implemented in Jethro. In a nutshell, we added  a simple, automatic range partitioning mechanism that is very easy to work with. Why Use Partitioning? Generally, there are two reasons to partition a large table: Ease of maintenance – partitioning allows implementing a data retention policy and enables efficient […]

Connecting To Jethro from Tableau

Jethro allows fast interactive queries over big data, by indexing all your data. Tableau is a popular BI tool that can be used with Jethro. Tableau runs on Windows 64-bit servers and connects to databases using ODBC. In order to connect to Jethro from Tableau, all you need to do is install our ODBC driver, […]