97在线视频人妻无码,av人摸人人人澡人人超碰,国产色无码专区在线观看,av大帝狠狠久久,中文字字幕在线中文乱码品,亚洲无码自拍视频,国自产精品手机在线视拍,无码免费中文字幕a级毛片,国产一级牲交高潮片16,国产精品久久免费看,欧美l日韩国产一级视频,国产精品亚洲产品一区二区三区,国自产精品手机在线视频香蕉,人妻少妇中文字幕久久18,Av免费无码一区二区……,亚洲色图无码高免费,久久精品视屏综合,亚洲真人无码一二三区

學(xué)術(shù)看板

當(dāng)前位置：首頁 > 學(xué)術(shù)交流 > 學(xué)術(shù)看板 > 正文

How big data enhance AI

日期：2018-05-15 來源：本站作者：關(guān)注：次

時(shí)間：2018年5月16日上午9:30

地點(diǎn)：望江校區(qū)東三教503會(huì)議室

報(bào)告人：唐明潔

報(bào)告人簡介：2007年永利yl23411官網(wǎng)計(jì)算機(jī)本科畢業(yè)，2010年從中國科學(xué)院研究生院取得計(jì)算機(jī)碩士學(xué)位，2013年從美國普渡大學(xué)獲得計(jì)算機(jī)碩士學(xué)位，2016年從美國普渡大學(xué)取得計(jì)算機(jī)博士學(xué)位。曾就職于美國微軟，IBM研究院?，F(xiàn)就職于大數(shù)據(jù)公司Hortonworks做研究科學(xué)家,主要從事Spark和TensorFlow的研究和開發(fā)。博士期間在包括VLDB, TKDE, ICDE, EDBT, SIGSPATILA, IEEEIntelj在內(nèi)的會(huì)議雜志發(fā)篇論文20余篇，曾獲得數(shù)據(jù)庫會(huì)議SISAP201最佳論文，數(shù)據(jù)挖掘會(huì)議ADMA2009最佳應(yīng)用論文，部分研究成果已經(jīng)被開源社區(qū)PostgreSQL和Spark所采用。

學(xué)術(shù)報(bào)告摘要：TensorFlow and XGBoost are state-of-the-art platform for Deep learning and Machine learning. However, either of them are suit for big data processing in real production environment. For example, TensorFlow fail to provide OLAP or ETL over big data, thus, it impedes TensorFlow to train a deep learning model with clean and enough data in more efficient way. Similarly, despite better performance compared with other gradient-boosting implementations, it’s still a time-consuming task to train XGBoost model when the data is big. And it usually requires extensive parameter tuning to get a highly accurate model, which brings the strong requirement to speed up the whole process.

In this talk, we will mainly introduce how Spark to improve TensorFlow and XGBoost in the real application, and demonstrate how these platforms could be benefit from big data techniques. More specifically, we at first introduce how Spark ML come to support auto parameter tuning, and apply transfer learning to enhance the real application like recommendation system and image searching. Secondly, we cover the implementation and performance improvement of GPU-based XGBoost algorithm, summarize model tuning experience and best practice, share the insights on how to build a heterogeneous data analytic and machine learning pipeline based on Spark in a GPU-equipped YARN cluster, and show how to push model into production.

【關(guān)閉】

国产精品久久久一级毛片_亚洲a级大片免费看_亚洲aⅴ在线无码播放_亚洲 欧美 制服

国产精品久久久一级毛片_亚洲a级大片免费看_亚洲aⅴ在线无码播放_亚洲欧美制服