Impala Benchmark and Tuning Tips

講者:徐瑞興 Simon Hsu, Etu
時段:15:30~16:20
地點:3F-第一會議室
講題:Impala Benchmark and Tuning Tips

摘要:

Impala is a query engine which runs on Hadoop. The impala massively parallel processing (MPP) engine makes SQL queries of Hadoop data easily accessible to analysts familiar with SQL and to business intelligence tools users.

Impala focus on improving query performance while retaining a familiar user experience.
With Impala, you can query data, whether stored in HDFS or Apache HBase (Including SELECT, JOIN, and aggregate functions – in real time. )
Today, I will illustrate some tips for improving impala performance, or some notes you need to notice while running this powerful tool.
Besides, I will show you the procedure of running benchmarks in Impala

Let's take a look and check it out!

講者簡介:

研究所時期開始接觸Hadoop,碩士論文以Hadoop系統流程架構面之改善,發表論文於IEEE BigData 2014 : "A Transparent Approach to Run
MapReduce Programs on Collaborative Hadoops"
曾於鴻海-中央資訊總處研發部門-負責集團Hadoop產品維運/開發
現於Etu(知意圖)擔任資深工程師,從事Hadoop相關解決方案/產品研發
Tagged on: ,