|講者：徐瑞興 Simon Hsu, Etu
講題：Impala Benchmark and Tuning Tips
Impala is a query engine which runs on Hadoop. The impala massively parallel processing (MPP) engine makes SQL queries of Hadoop data easily accessible to analysts familiar with SQL and to business intelligence tools users. Impala focus on improving query performance while retaining a familiar user experience. With Impala, you can query data, whether stored in HDFS or Apache HBase (Including SELECT, JOIN, and aggregate functions – in real time. ) Today, I will illustrate some tips for improving impala performance, or some notes you need to notice while running this powerful tool. Besides, I will show you the procedure of running benchmarks in Impala Let's take a look and check it out!
研究所時期開始接觸Hadoop，碩士論文以Hadoop系統流程架構面之改善，發表論文於IEEE BigData 2014 : "A Transparent Approach to Run MapReduce Programs on Collaborative Hadoops" 曾於鴻海-中央資訊總處研發部門-負責集團Hadoop產品維運/開發 現於Etu(知意圖)擔任資深工程師，從事Hadoop相關解決方案/產品研發
- HadoopCon 2014 年會籌備現況 @ 2014-08-26
- Real-Time Streaming Data Computing for long-term undersea surveillance on top of Storm