O’REILLY、INTEL AI主办

English中文
将人工智能用起来
2019年6月18-21日
北京,中国

人工智能大会2019讲师

新讲师会不断加入,请方便的时候回来查看最近更新。

过滤器

搜索讲师

Sarah Aerni is a Director of Data Science at Salesforce Einstein, where she leads teams building AI-powered applications using autoML. Prior to Salesforce she led the healthcare & life science and Federal teams at Pivotal. Sarah obtained her PhD from Stanford University in Biomedical Informatics, performing research at the interface of biomedicine and machine learning. She also co-founded a company offering expert services in informatics to both academia and industry.

Presentations

Achieving Salesforce-Scale Machine Learning in Production 40分钟议题 (40-minute session)

At Salesforce Einstein data science is an agile partner to over 100,000 customers. How do we achieve this scale? We share lessons learned in business, technology and process along the way. Via use cases, oft-missed foundational elements for deployment, and the evaluations that must happen along the way, we will share how to achieve and sustain models in production, and where to go from there.

Vijay Srinivas Agneeswaran is a senior director of technology at Publicis Sapient. Vijay has spent the last 12 years creating intellectual property and building products in the big data area at Oracle, Cognizant, and Impetus, including building PMML support into Spark/Storm and implementing several machine learning algorithms, such as LDA and random forests, over Spark. He also led a team that build a big data governance product for role-based, fine-grained access control inside of Hadoop YARN and built the first distributed deep learning framework on Spark. Earlier in his career, Vijay was a postdoctoral research fellow at the LSIR Labs within the Swiss Federal Institute of Technology, Lausanne (EPFL). He is a senior member of the IEEE and a professional member of the ACM. He holds four full US patents and has published in leading journals and conferences, including IEEE Transactions. His research interests include distributed systems, cloud, grid, peer-to-peer computing, machine learning for big data, and other emerging technologies. Vijay holds a bachelor’s degree in computer science and engineering from SVCE, Madras University, an MS (by research) from IIT Madras, and a PhD from IIT Madras.

Presentations

Industrialized Capsule Networks for Text Analytics 40分钟议题 (40-minute session)

We illustrate how capsule networks can be industrialized: 1. Overview of capsule networks and how they help in handling spatial relationships between objects in an image. We also learn about how they can be applied to text analytics. 2. We show an implementation of recurrent capsule networks, which are useful in text analytics, especially for some tasks such as summarization or classification.

Jesse Anderson is a Big Data Engineering expert and trainer.

Presentations

Professional Kafka development 2天培训 (2-day Training)

Jesse Anderson leads a deep dive into Apache Kafka. You'll learn how Kafka works and how to create real-time systems with it. You'll also discover how to create consumers and publishers in Kafka and how to use Kafka Streams, Kafka Connect, and KSQL as you explore the Kafka ecosystem.

Chris Butler is the director of AI at Philosophie, where he leads the firm in human-centered AI engagements. Chris has over 19 years of product and business development experience at companies like Microsoft, KAYAK, and Waze. He was first introduced to AI through graph theory and genetic algorithms while studying computer systems engineering at Boston University and has worked on AI-related projects at his startup Complete Seating (data science and constraint programming), Horizon Ventures (advising portfolio companies like Affectiva), and Philosophie (AI consulting and coaching). He has created techniques like empathy mapping for the machine and confusion mapping to create cross-team alignment while building AI products.

Presentations

Design Thinking for AI 3小时辅导课 (3-hour Tutorial)

Purpose, a well-defined problem, and trust from people are important factors to any system, especially those that employ AI. Chris Butler leads you through exercises that borrow from the principles of design thinking to help you create more impactful solutions and better team alignment.

Yue Cathy Chang is co-founder and CEO at TutumGene, a technology company that aims to accelerate disease curing by providing solutions for gene therapy and regulation of gene expression.

Cathy is a business executive recognized for sales, business development, and product marketing in high technology. She was most recently with Silicon Valley Data Science, a startup that provided business transformation consulting to enterprises and other organizations using data science- and engineering-based solutions. Prior to that, Cathy was employee #1 hired by the CEO at venture-funded software startup Rocana (acquired by Splunk), where she served as Senior Director of Business Development focusing on building and growing long-term relationships, and notably increased sales leads 2x through building and managing indirect revenue channels.

Prior to Rocana, Cathy held multiple strategic roles at blue chip software enterprise companies as well as startups, including Corporate and Business Development at FeedZai and Datameer; Senior product management, product marketing and sales at Symantec and IBM; and Strategic Sourcing Improvement Consulting at Honeywell.

Cathy holds MS and BS degrees in Electrical and Computer Engineering from Carnegie Mellon University, MBA and MS degrees as a Leaders for Global Operations (LGO) duel-degree fellow from MIT, and two patents for her early work in microprocessor logic design.

Presentations

Artificial intelligence meets genomics: accelerating understanding of our genetic make up and use of genome editing to revolutionize medicine 40分钟议题 (40-minute session)

Genome editing has been dubbed as a top technology that could create trillion-dollar markets in the next decade. Recent advancements in the application of AI to genomic editing are accelerating transformation of medicine. We will discuss how AI is applied to genome sequencing, genome editing and their potential to correct mutations, and questions on using genome editing to optimize human health.

Dr. Dongfeng Chen is the engineering director of Clobotics, a global leader in computer vision solutions for the wind power and retail industries. Clobotics’ end-to-end solutions combine computer vision, artificial intelligence/machine learning, and data analytics software with different hardware form factors, including autonomous drones, mobile applications, and other IoT devices to help companies automate time-intensive operational processes, increase efficiencies, and boost the bottom line using real-time, data-driven, and actionable insights. Clobotics is a company with dual-headquarters in Shanghai and Seattle, and has expanded its footprint to Beijing, Dalian, and Singapore. Dr. Dongfeng Chen is currently leading the Clobotics retail research and development team in Shanghai.
Prior to Clobotics, Chen was a senior architect at Baidu. He recruited, led, and build a team of more than 30 members, including developers, testers, and product managers, and created the core algorithms for Baidu advertisement and Baidu Kuaixing (online travel booking site), and Baidu Mall (a flash-sale e-commerce platform). He is an expert in machine learning and distributed systems. Through the innovative use of knowledge graph, Chen’s team developed an effective way that associates Baidu search pool with paid advertisements, this in turn brought in more than tens of millions USD in revenue, and this technology is still being used until today.
Prior to Baidu, Chen was a seasoned serial entrepreneur. In 2010, Chen’s team developed China’s first “Groupon” website for the travel industry, it was the first travel website that offers group bundle deals. Statistics shows the website accounts for 30 – 50% of the market share of short- distance travel near Shanghai. This experience gave Chen a great foundation to build Baidu Kuaixing, Baidu online travel booking site, later on.
Dr. Chen received his Ph.D. in Computer Science from North Carolina State University. His Ph.D. thesis topic was on using structured views to optimize query in information integration.

陈东锋 博士 扩博智能高级研发总监。加入扩博智能之前,陈东锋博士曾担任百度高级架构师。任职期间,陈东锋博士管理和带领研发团 队专注于百度电商广告和百度快行业务的软件研发、测试和产品管理。并为百度快行、特卖频道和电 商知心等等项目开发了核心算法。陈东锋博士的工作成果通过知识图谱技术的创新运用把百度搜索池 与电商广告强相关,为百度在电商特卖领域带来从 0 到上亿人民币的持续巨额营收,该核心技术沿用 至今。陈东锋博士带领团队开发的百度快行项目的成功,使得百度汽车票和火车票交易业务在两年内 成为百度 O2O 垂直行业 GMV第一名,用户体验大大提升。
加入百度之前,陈东锋博士是一名经验丰富的连续创业者。2010 年陈东锋博士带领团队开发了国 内第一个旅游行业团购网站,据数据统计该网站占上海周边短途旅游交易额 30-50%市场份额,这也 为陈东锋博士研发百度出行业务提供了借鉴意义。
陈东锋博士拥有北卡罗来纳州立大学(North Carolina State University)计算机科学博士学位。他 的博士论文主题是使用结构化视图来优化信息集成中的查询。
陈东锋博士是扩博智能智慧零售研发负责人,负责产品研发及交付。扩博智能聚焦计算机视觉和机 器学习技术,专注为行业企业用户提供端到端一体化智能服务,能大力提升传统行业运营效率,加快 数字化变革,所服务的行业包括零售和风电。扩博智能总部位于中国上海和美国西雅图,在北京和大 连设有办事处,新加坡设有分公司。

Presentations

How AI is Revolutionizing the Wind Power Industry 40分钟议题 (40-minute session)

In this talk, we will share the successes and failures of creating an entirely autonomous visual recognition-powered drone inspection solution for turbine blades, which increased the efficiency by 10 times.

Roger Chen is cofounder and CEO of Computable and program chair for the O’Reilly Artificial Intelligence Conference. Previously, he was a principal at O’Reilly AlphaTech Ventures (OATV), where he invested in and worked with early-stage startups primarily in the realm of data, machine learning, and robotics. Roger has a deep and hands-on history with technology. Before startups and venture capital, he was an engineer at Oracle, EMC, and Vicor. He also developed novel nanoscale and quantum optics technology as a PhD researcher at UC Berkeley. Roger holds a BS from Boston University and a PhD from UC Berkeley, both in electrical engineering.

Presentations

Friday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

Thursday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

Yijing Chen is a senior data scientist in the Cloud AI Group at Microsoft, where she works with external customers in areas such as energy demand forecast, user mobile behavioral analysis, retail demand forecast, energy theft detection, product pricing, and medical claim denial prediction as well as on other projects using various machine learning methods. Yijing holds an MA in statistics from Harvard University.

Presentations

基于深度学习的时间序列预测 (Deep Learning for Time Series Forecasting) 3小时辅导课 (3-hour Tutorial)

Almost every business today uses forecasting to make better decisions and allocate resources more effectively. Deep learning has achieved a lot of success in computer vision, text and speech processing, but has only recently been applied to time series forecasting. In this tutorial we show how and when to apply deep neural networks to time series forecasting. The tutorial will be in CHN and EN.

Chin is the data engineer working at Rakuten who originated and lead the team building the data science platform.

Presentations

Best practice of building data science platform in Rakuten 40分钟议题 (40-minute session)

Data Science Platform is a suite of tools for exploring data, training models, and running GPU/CPU compute jobs in an isolated container environment. It provides one click machine learning environment creation, powerful job scheduler and flexible "function as a service" component. It runs on Kubernetes and supports both on-premises and cloud environment, as well as hybrid mode.

种骥科博士,清华兼职教授,现任美国Acorns首席数据科学家. 种骥科曾任职于宜人贷 (NYSE:YRD) 首席数据科学家,负责反欺诈风控和数字驱动的运营和创新。种博士曾任职于美国Simply Hired招聘平台,创建了数据科学部, 并应邀为白宫科技办公室参谋大数据技术产品设计。还曾就职于美国Silver Lake 私募公司任Kraftwerk基金数据科学架构师,负责大数据技术在私募投资风控方面的应用。种骥科曾任美国卡内基梅隆大学教授与博士生导师,持有加州大学伯克利分校电子工程和计算机科学系博士学位,卡内基梅隆大学电子和计算机工程系硕士及本科学位,和10项美国专利(5项获准,5项待批)。

Presentations

量化互联网金融信用与反欺诈风控 2天培训 (2-day Training)

您想了解金融企业是怎样利用大数据和人工智能技术来画像个人行为并检测欺诈用户的吗?互联网金融幕后的量化分析流程是怎么杨的?个人信用是怎样通过大数据被量化的?在实践过程中,机器学习算法的应用存在着哪些需要关注的方面?怎样通过图谱分析来融合多维数据,为我们区分正常用户和欺诈用户? 这套辅导课基于清华大学交叉信息研究院开设的一门"量化金融信用与风控分析”研究生课。其中会用LendingClub的真实借贷数据做为案例,解说一些具体模型的实现。

崔宏宇,现任DataVisor中国区技术负责人,自2015年起在DataVisor开发使用分布式无监督机器学习算法进行反欺诈检测。负责过如Pinterest、Yelp、阿里巴巴和猎豹移动等大型互联网企业的机器注册、虚假评论、垃圾邮件、欺诈交易和虚假应用安装等场景的反欺诈建模 。在模型调优、特征工程和算法开发等领域都有着丰富的经验。崔宏宇拥有在爱荷华州立大学的博士学位,在博士期间的研究方向为数据分析和结构 – 性能建模等。

Presentations

运用自动化AI技术打击“智能化”网络欺诈 40分钟议题 (40-minute session)

AI技术在赋能各个产业的同时,也被网络黑产所利用,使得黑产攻击更加自动化,更加隐蔽,难于检测。 DataVisor在互联网反欺诈领域研究发现,目前黑产的攻击模型呈现以下趋势:攻击方法多样化而变化快,攻击手段趋于模拟正常用户,攻击账号主要来源由大规模注册渐渐转向ATO账号。传统的规则系统和有监督的模型,由于对欺诈案例以及标签数据的强依赖,往往无法及时应对迅速演化的黑产攻击,在反欺诈中一直处于被动防守的状态。DataVisor的无监督算法,通过全局分析,在高维空间聚类,可以在无标签情况下,自动发现大规模关联欺诈团伙。无监督算法在提前预警以及检测快速演变欺诈模式方面体现了显著的优势。

Jason (Jinquan) Dai is a senior principal engineer and CTO of big data technologies at Intel, where he is responsible for leading the global engineering teams (located in both Silicon Valley and Shanghai) on the development of advanced big data analytics (including distributed machine and deep learning), as well as collaborations with leading research labs (e.g., UC Berkeley AMPLab and RISELab). Jason is an internationally recognized expert on big data, cloud, and distributed machine learning; he is the program cochair of the O’Reilly AI Conference in Beijing, a founding committer and PMC member of Apache Spark, and the creator of BigDL, a distributed deep learning framework on Apache Spark.

Presentations

Friday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

Thursday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

Bin Fan is a software engineer at Alluxio and a PMC member of the Alluxio project. Previously, Bin worked at Google building next-generation storage infrastructure, where he won Google’s Technical Infrastructure award. He holds a PhD in computer science from Carnegie Mellon University.

Presentations

AVA: a Cloud-Native Deep Learning Platform at Qiniu 40分钟议题 (40-minute session)

Atlab Lab at Qiniu Cloud focuses on deep learning for computer vision. Our team has built a high-performance and cost-effective training platform based on Cloud for deep learning, called AVA, which deeply integrates open source software stack including Tensorflow, Caffe, Alluxio and KODO our own cloud object storage.

Huaxin Gao is a software engineer in IBM Open Source Data and AI group with focus on Apache Spark machine learning and deep learning. She is an active code contributor to Apache Spark project.

Presentations

AI Pipelines on container platform 40分钟议题 (40-minute session)

AI pipelines simplifies the lifecycle workflow management and enhances the reproducibility and collaboration for machine learning/deep learning. A cloud native platform solution is great at portability and scalability. Combining both strengths, AI pipelines on container platform can help accelerate both AI applications development and deployment.

Bas Geerdink is a programmer, scientist, and IT manager at ING, where he is responsible for the fast data systems that process and analyze streaming data. Bas has a background in software development, design, and architecture with broad technical experience from C++ to Prolog to Scala. His academic background is in artificial intelligence and informatics. Bas’s research on reference architectures for big data solutions was published at the IEEE conference ICITST 2013. He occasionally teaches programming courses and is a regular speaker at conferences and informal meetings.

Presentations

AI at ING: the why, how, and what of a data-driven enterprise 40分钟议题 (40-minute session)

AI is at the core of ING’s business. We are a data-driven enterprise, with ‘analytics skills’ as a top strategic priority. We are investing in AI, big data, and analytics to improve business processes such as balance forecasting, fraud detection and customer relation management. In this talk, Bas will give an overview of the use cases and technology to inspire the audience!

Chenhui Hu is a Data Scientist in the Cloud AI organization at Microsoft. His current interests include retail forecast, inventory optimization, IoT data, and deep learning. He received his PhD degree from Harvard University with his PhD thesis focusing on biomedical imaging data mining. He also has research experience in wireless networks and network data analysis. He is a recipient of the third IEEE ComSoc Asia-Pacific Outstanding Paper Award. 

Presentations

Forecasting Customer Activities with Dilated Convolution Neural Networks: Use Case and Best Practices 40分钟议题 (40-minute session)

Forecasting customer activities is one of the most important and common business problems. In Microsoft Azure Identity team, we forecast customer behavior based on billions of user activities. We will share how we improve 25% of forecasting accuracy with dilated convolutional neural networks and reduce 80% of the time in development with the best practices of time series forecasting.

Shengsheng (Shane) Huang is a software architect at Intel and an Apache Spark committer and PMC member, leading the development of large-scale analytical applications and infrastructure on Spark in Intel. Her area of focus is big data and distributed machine learning, especially deep (convolutional) neural networks. Previously at NUS (National University of Singapore), her research interests are large-scale vision data analysis and statistical machine learning.

Presentations

Game playing using AI on Spark 40分钟议题 (40-minute session)

In this presentation we will share experiences from our attempts in using AI on Spark for game playing.

Alex Ingerman leads the product management team at Google Research, focusing on federated learning and other privacy-preserving technologies for machinde learning. He joined Googlew in 2016 after working on products including ML-as-a-service platform for developers, web-scale search, content recommendation system and immersive data-exploration environments. Alex holds a BS in computer science and an MS in medical engineering.

Presentations

The future of machine learning is decentralized 40分钟议题 (40-minute session)

Federated Learning is the approach of training ML models across a fleet of participating devices, without collecting their data in a central location. Alex Ingerman introduces Federated Learning, compares the traditional and federated ML workflows, and explores the current and upcoming use cases for decentralized machine learning, with examples from Google's deployment of this technology.

Jewel James is currently working as a product analyst at Go-jek

Presentations

Using ML for personalizing Food Recommendations 40分钟议题 (40-minute session)

The story of how we prototyped the search framework that personalizes the restaurant search results by using ML to learn what constitutes a relevant restaurant given a user's purchasing history

Michael James is Founder and Chief Architect in Advanced Technologies: Mathematics, Algorithms and Software at Cerebras Systems and a pioneer in geometrically mapped algorithms. Cerebras is a computer hardware company developing deep technologies to scale and accelerate machine learning by orders of magnitude for AI applications.

Previously, Michael was a Fellow at Advanced Micro Devices. Under his leadership, the team designed first-of-its-kind technology based on a self-healing fabric interconnect to allow reliable operation of large computer clusters. Michael joined AMD via the acquisition of SeaMicro Systems, where he was the Chief Architect specializing in real-time workload placement and routing algorithms. Michael’s experience includes the fields of computer-automated language translation; algorithms for gesture recognition; compilers; operating systems and micro-controller design. Outside Cerebras, Michael provides advice and talks to established Silicon Valley companies on a diverse range of AI topics.

Michael’s passion for AI comes from his root in academia: He received bachelor’s degrees from UC Berkeley in the domains of Mathematics, Computer Science and Neurobiology.

Presentations

Keynote by Michael James 主题演讲 (Keynote)

Keynote by Michael James

Yangqing Jia is director of engineering for Facebook’s AI platform team, which develops general-purpose open source AI solutions that serve as the backbone of Facebook AI products, such as ranking, computer vision, natural language processing, speech recognition, mobile AI, and AR. He has been influential in developing an open source deep learning software stack, many of the components of which serve as the de facto industry standard in AI. He is the creator or cocreator of Caffe, TensorFlow, Caffe2, ONNX, and PyTorch 1.0. Lately, he has been focused on the design and evolution of the AI hardware and software ecosystem and the combination of AI research and conventional wisdom of computer science.

Presentations

Keynote with Yangqing Jia 主题演讲 (Keynote)

Keynote with Yangqing Jia

Jing(Nicole) is a data scientist experienced with different machine learning/deep learning model and deals with big data and transform data/model into products and service that drive business.

Presentations

Real-time product recommendations leveraging deep learning on Apache Spark in Office Depot 40分钟议题 (40-minute session)

To show case how to build efficient recommender systems for e-commerce industry using deep learning technologies

Tim Kraska is an associate professor of electrical engineering and computer science at MIT’s Computer Science and Artificial Intelligence Laboratory. Currently, his research focuses on building systems for machine learning and using machine learning for systems. Tim spent the majority of 2017 at Google Research, where he invented the concept of learned index structures with the MLX and Brain teams. Tim was recently selected as a 2017 Alfred P. Sloan Research Fellow in computer science. He has also received the 2017 VMware Systems Research Award, NSF CAREER Award, an Air Force Young Investigator award, two Very Large Data Bases (VLDB) conference best demo awards, and a best paper award from the IEEE International Conference on Data Engineering (ICDE).

Presentations

Keynote by Tim Kraska 主题演讲 (Keynote)

Keynote by Tim Kraska

Abhishek Kumar is a senior manager of data science in Sapient’s Bangalore office, where he looks after scaling up the data science practice by applying machine learning and deep learning techniques to domains such as retail, ecommerce, marketing, and operations. Abhishek is an experienced data science professional and technical team lead specializing in building and managing data products from conceptualization to deployment phase and interested in solving challenging machine learning problems. Previously, he worked in the R&D center for the largest power-generation company in India on various machine learning projects involving predictive modeling, forecasting, optimization, and anomaly detection and led the center’s data science team in the development and deployment of data science-related projects in several thermal and solar power plant sites. Abhishek is a technical writer and blogger as well as a Pluralsight author and has created several data science courses. He is also a regular speaker at various national and international conferences and universities. Abhishek holds a master’s degree in information and data science from the University of California, Berkeley.

Presentations

Industrialized Capsule Networks for Text Analytics 40分钟议题 (40-minute session)

We illustrate how capsule networks can be industrialized: 1. Overview of capsule networks and how they help in handling spatial relationships between objects in an image. We also learn about how they can be applied to text analytics. 2. We show an implementation of recurrent capsule networks, which are useful in text analytics, especially for some tasks such as summarization or classification.

Chaoguang has been working in distributed systems for more than 10 years. He was working at IBM on the first generation of SSD tiered storage DS8000, then he was the chief architect of the all-flash storage Dorado Cache in Huawei. Currently he is the leading the deep learning platform at Qiniu.

Presentations

AVA: a Cloud-Native Deep Learning Platform at Qiniu 40分钟议题 (40-minute session)

Atlab Lab at Qiniu Cloud focuses on deep learning for computer vision. Our team has built a high-performance and cost-effective training platform based on Cloud for deep learning, called AVA, which deeply integrates open source software stack including Tensorflow, Caffe, Alluxio and KODO our own cloud object storage.

Zhichao Li is a senior software engineer at Intel focused on distributed machine learning, especially large-scale analytical applications and infrastructure on Spark. He’s also an active contributor to Spark. Previously, Zhichao worked in Morgan Stanley’s FX Department.

Presentations

Analytics Zoo: Distributed Tensorflow and Keras on Apache Spark 3小时辅导课 (3-hour Tutorial)

In this tutorial, we will show how to build and productionize deep learning applications for Big Data using "Analytics Zoo":https://github.com/intel-analytics/analytics-zoo (a unified analytics + AI platform that seamlessly unites Spark, TensorFlow, Keras and BigDL programs into an integrated pipeline) using real-world use cases (such as JD.com, MLSListings, World Bank, Baosight, Midea/KUKA, etc.)

Richard Liaw is a PhD student in the BAIR Lab and RISELab at UC Berkeley working with Joseph Gonzalez, Ion Stoica, and Ken Goldberg. He has worked on a variety of different areas, ranging from robotics to reinforcement learning to distributed systems. He is currently actively working on Ray, a distributed execution engine for AI applications; RLlib, a scalable reinforcement learning library; and Tune, a distributed framework for model training.

Presentations

Building reinforcement learning models and AI applications with Ray 3小时辅导课 (3-hour Tutorial)

Ray is a general purpose framework for programming your cluster. We will lead a deep dive into Ray, walking you through its API and system architecture and sharing application examples, including several state-of-the-art AI algorithms.

刘影,现任鲸算科技高级风控经理,集团科技教育品牌鲸小小联合创始人。武汉大学学士,加拿大纽芬兰纪念大学硕士,电子工程专业,海外读研期间当过研究员,曾在雷达信号处理领域发表过两篇SCI,参加过三次国际学术会议。2014年底毕业后在青岛墨尔文中学担任一年全英文数学老师,教L5-U6的中学生IGCSE/ALEVEL数学课,并组织了青少年编程俱乐部。2016年加入鲸算科技(原闪银Wecash),任职数据科学家一年,从事互联网金融信用评估特征工程搭建和线上模型研发。2017年至今,转岗为风控高级经理,致力于公司的数据管理工作,深度挖掘数据商业价值,与资方、财务、产品运营、催收、人事管理等团队紧密合作,设计AB实验,帮助大家为公司降本增效,希望把数据科学技术落地为更大更广的商业价值。

像所有其他从事数据科学的同行一样,渴望看见AI技术在更多领域落地,产生影响力。也想鼓励更多女性同行加入到这项激动人心的事业中来。在设计AI产品时,像”Design of Everyday Things”书中讲到的,在追求理性效率同时,倡导更多人文关怀,人性化地去解决问题,共同提高AI落地的方法论,造福人类未来生活。

Ying (Claire) Liu, a Senior Risk Management Manager at Abakus and co-founder of Abakus Kids (a new edTech brand). She received her B.Eng. in radio physics (radio wave propagation and antenna) at Wuhan University in 2007. She also completed a M.Eng. in electrical and computer engineering at Memorial University of Newfoundland in Canada in 2014.

*As a research student, she published 2 SCI papers and attended several international conferences in radar signal processing in 2012-2014.
*Back in China, she taught L5-U6 kids IGCSE/ALEVEL Math class and held a web development club at Malvern College Qingdao in 2015
*From 2016 to 2017, she worked as a data scientist in Abakus Group (Wecash China), from feature engineering to model deploy, witnessing AI technology accelerate online lending in China.
*So far, she assumed herself both a data officer and product manager, dived into the architecture of our data management platform, shared the knowledge of data in our company and spared no effort to help people do better on creating business value, collaborating closely with product, operation, finance and collection teams with data.
*Her second career is as a cofounder of education technology at Abakus Kids which is a startup founded in 2018.

PS, her LinkedIn profile is as follows:
https://www.linkedin.com/in/claire-ying-liu-28948086/

Presentations

A Humane AI Solution to Improve Debt Collection 40分钟议题 (40-minute session)

AI debt collection platform of Abakus provides a friendly and humane product solution which is designed for people who work in the live agents of the organization in the frontline. The agent training of the organization could be enhanced more smoothly with an AI friendly culture. It has been proved in our experiment that the performance of the collection assistants has been highly improved.

Ben Lorica is the chief data scientist at O’Reilly Media. Ben has applied business intelligence, data mining, machine learning, and statistical analysis in a variety of settings, including direct marketing, consumer and market research, targeted advertising, text mining, and financial engineering. His background includes stints with an investment management company, internet startups, and financial services.

Presentations

Friday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

Thursday opening remarks 主题演讲 (Keynote)

Opening keynote remarks by Program Chairs Ben Lorica, Jason Dai, and Roger Chen

David Low is currently the Co-founder and Chief Data Scientist at Pand.ai, building AI-powered chatbot to disrupt and shape the booming conversational commerce space with Deep Natural Language Processing. He represented Singapore and National University of Singapore (NUS) in Data Science Game’16 at France and clinched top spot among Asia and America teams. Recently David has been invited as a guest lecturer by NUS to conduct masterclasses on applied Machine Learning and Deep Learning topics. Prior to Pand.ai, he was a Data Scientist with Infocomm Development Authority (IDA) of Singapore.

Throughout his career, David has engaged in data science projects ranging from Manufacturing, Telco, E-commerce to Insurance industry. Some of his works including sales forecast modeling and influencer detection had won him awards in several competitions and was featured on IDA website and NUS publication. Earlier in his career, David was involved in research collaborations with Carnegie Mellon University (CMU) and Massachusetts Institute of Technology (MIT) on separate projects funded by National Research Foundation and SMART. As a pastime activity, he competed on Kaggle and achieved Top 0.2% worldwide ranking.

Presentations

The Unreasonable Effectiveness of Transfer Learning on NLP 40分钟议题 (40-minute session)

Transfer Learning has been proven to be a tremendous success in the Computer Vision field as a result of ImageNet competition. In the past months, the Natural Language Processing field has witnessed several breakthroughs with transfer learning, namely ELMo, Transformer, ULMFit and BERT. In this talk, David will be showcasing the use of transfer learning on NLP application with SOTA accuracy.

Tao Lu is a Data Scientist in the Cloud and AI organization at Microsoft. He has strong background in applying machine learning and deep learning techniques to forecasting problems. He has deep domain knowledge in cloud identity and financial services industry. He graduated from University of Washington with a master degree in Computational Finance.

Presentations

Forecasting Customer Activities with Dilated Convolution Neural Networks: Use Case and Best Practices 40分钟议题 (40-minute session)

Forecasting customer activities is one of the most important and common business problems. In Microsoft Azure Identity team, we forecast customer behavior based on billions of user activities. We will share how we improve 25% of forecasting accuracy with dilated convolutional neural networks and reduce 80% of the time in development with the best practices of time series forecasting.

Zhenxiao Luo is an engineering manager at Uber, where he runs the interactive analytics team. Previously, he led the development and operations of Presto at Netflix and worked on big data and Hadoop-related projects at Facebook, Cloudera, and Vertica. He holds a master’s degree from the University of Wisconsin-Madison and a bachelor’s degree from Fudan University.

Presentations

Query the planet: Geospatial big data analytics at Uber 40分钟议题 (40-minute session)

One of the distinct challenges for Uber is analyzing geospatial big data. Locations and trips provide insights that can improve business decisions and better serve users. Geospatial data analysis is particularly challenging, especially in a big data scenario. For these analytical requests, we must achieve efficiency, usability, and scalability in order to meet user needs and business requirements.

Founder, CEO and CTO.

David is a serial entrepreneur, with his most previous company – Hexatier/GreenSQL – acquired by Huawei. He was a founder of Precos, Vanadium-soft, GreenCloud, Teridion, Terrasic, and Re-Sec, among others.

Previously a director in Fortinet’s CTO office, he managed information security at Bezeq, the Israeli Telecom.

He has 24 years’ experience in leadership, AI, Cyber security, development and networking and is a veteran of an elite IDF unit.

Named one of the Top-40 Israeli Internet Startup Professionals by TheMarker Magazine and Top 40 under 40 most promising Israeli business professionals by Globes Magazine.

David holds a master’s in computer science from Open University.

Presentations

Hacking Humans Made Easy: Signal Processing + AI + Video 40分钟议题 (40-minute session)

Zero-day attacks. IoT-based botnets. Cybercriminal AI v. cyberdefender AI. While these won’t be going away, they aren’t the biggest worry we have in cybercrime. Hacking humans is. The combination of mere minutes of video, signal processing, remote heart rate monitoring, AI, machine learning, and data science can identify a person’s health vulnerabilities, which evildoers can make worse.

Aileen Nielsen works at an early-stage NYC startup that has something to do with time series data and neural networks. Previously, Aileen worked at corporate law firms, physics research labs, a variety of NYC tech startups, and most recently, the mobile health platform One Drop as well as on Hillary Clinton’s presidential campaign. Her interests range from defensive software engineering to UX designs for reducing cognitive load to the interplay between law and technology. She also serves as chair of the New York City Bar Association’s Science and Law Committee, which focuses on how the latest developments in science and computing should be regulated and how such developments should inform existing legal practices. Aileen is a frequent speaker at machine learning conferences on both technical and legal subjects.

Presentations

Deep prediction: A year in review for deep learning for time series 40分钟议题 (40-minute session)

Deep learning for time series analysis has made rapid progress in 2018 and 2019, with advances in the use of both convolutional and recurrent neural network architectures. The state of the art in deep forecasting will be summarized for 2018 and 2019, including use cases in both forecasting and generating time series.

Emma Ning is senior Program Manager in Microsoft Cloud&AI ML Platform team, focusing on AI model operationalization and acceleration with ONNX/ONNXRuntime in support of Microsoft’s strategic investment for open and interoperable AI. She had been driving search engine experience for more than 5 years and later on 2 years on exploring adoption of AI among various businesses. Emma holds a MS in computer science from Institute of Computing Technology, Chinese Academy of Sciences.

Presentations

ONNX:开放和互操作平台让AI无处不在(AI everywhere: Open and interoperable platform for AI with ONNX) 40分钟议题 (40-minute session)

An open and interoperable ecosystem enables you to choose the framework that's right for you, train at scale, and deploy to cloud and edge. ONNX provides a common format supported by many popular frameworks and hardware accelerators. This session provides an introduction to ONNX and its core concepts. The session will be delivered in English and Chinese jointly.

Richard Ott is a data scientist in residence at the Data Incubator, where he gets to combine his interest in data with his love of teaching. Previously, he was a data scientist and software engineer at Verizon. Rich holds a PhD in particle physics from the Massachusetts Institute of Technology, which he followed with postdoctoral research at the University of California, Davis.

Presentations

Deep Learning with PyTorch 2天培训 (2-day Training)

PyTorch is a machine learning library for Python that allows users to build deep neural networks with great flexibility. Its easy to use API and seamless use of GPUs make it a sought after tool for deep learning. This course will introduce the PyTorch workflow and demonstrate how to use it. Students will be equipped with the knowledge to build deep learning models using real-world datasets.

Vanja Paunic is a data scientist in the Algorithms and Data Science Group at Microsoft London. She works on building machine learning solutions with external companies utilizing Microsoft’s AI Cloud Platform. She holds a PhD in computer science with a focus on data mining in the biomedical domain from the University of Minnesota.

Presentations

基于深度学习的时间序列预测 (Deep Learning for Time Series Forecasting) 3小时辅导课 (3-hour Tutorial)

Almost every business today uses forecasting to make better decisions and allocate resources more effectively. Deep learning has achieved a lot of success in computer vision, text and speech processing, but has only recently been applied to time series forecasting. In this tutorial we show how and when to apply deep neural networks to time series forecasting. The tutorial will be in CHN and EN.

Dmitry Pechyoni is a senior data scientist in the Cloud AI Group at Microsoft, where he works on building end-to-end data science solutions in various domains, including retail, energy management, and predictive maintenance. Previously, he built machine learning models for display advertising Akamai and MediaMath. Dmitry holds a PhD in theoretical machine learning from the Technion – Israel Institute of Technology.

Presentations

基于深度学习的时间序列预测 (Deep Learning for Time Series Forecasting) 3小时辅导课 (3-hour Tutorial)

Almost every business today uses forecasting to make better decisions and allocate resources more effectively. Deep learning has achieved a lot of success in computer vision, text and speech processing, but has only recently been applied to time series forecasting. In this tutorial we show how and when to apply deep neural networks to time series forecasting. The tutorial will be in CHN and EN.

Mark has had an interest in machine learning and artificial intelligence since doing a Masters at University of Toronto in the 80s. Currently he works at IBM and is responsible for shepherding customers to a variety of database products, including IBM Integrated Analytics System, which includes a full-blow machine learning environment: DSX. Mark’s interests in machine learning include deep learning on structured data and NLP.

Presentations

Using deep learning and time-series forecasting to reduce transit delays 40分钟议题 (40-minute session)

Toronto is unique among North American cities for having a legacy streetcar network as an integral part of its transit system. This means streetcar delays are a major contributor to gridlock in the city. Using deep learning and time-series forecasting, we'll show how streetcar delays can be predicted... and prevented.

Sujatha Sagiraju is a Group Program Manager in the Azure Cloud & AI group. Her expertise is in building large scale distributed systems. Her latest mission is accelerating and democratizing Artificial Intelligence via Automated Machine Learning. She has been at Microsoft since 2001 in various roles including developer, program manager and capacity planner. Other interests – Sujatha is a diversity & inclusion champion at the Azure AI platform org and is passionate about recruiting, mentoring and growing diverse talent.

Presentations

通过自动化机器学习民主化和加速AI落地 (Democratizing and Accelerating AI through Automated Machine Learning) 3小时辅导课 (3-hour Tutorial)

Intelligent experiences powered by AI can seem like magic to users. Developing them, however, is pretty cumbersome involving a series of sequential and interconnected decisions along the way that are pretty time consuming. What if there was an automated service that identifies the best machine learning pipelines for a given problem/data? Automated machine learning does exactly that!

Kaz Sato is a staff developer advocate on the Cloud Platform team at Google, where he leads the developer advocacy team for machine learning and data analytics products such as TensorFlow, the Vision API, and BigQuery. Kaz has been leading and supporting developer communities for Google Cloud for over seven years. He is a frequent speaker at conferences, including Google I/O 2016, Hadoop Summit 2016 San Jose, Strata + Hadoop World 2016, and Google Next 2015 NYC and Tel Aviv, and has hosted FPGA meetups since 2013.

Presentations

ML Ops and Kubeflow Pipeline 40分钟议题 (40-minute session)

Creating an ML model is just a starting point. To bring the technology into production service, you need to solve various real-world issues such as: building a data pipeline for continuous training, automated validation of the model, version control of the model, scalable serving infra, and ongoing operation of the ML infra with monitoring and alerting.

Alejandro Saucedo is the Chief Scientist at The Institute for Ethical AI & Machine Learning. With over 10 years of software development experience, Alejandro has held technical leadership positions across hyper-growth scale-ups and tech giants including Eigen Technologies, Bloomberg LP and Hack Partners. Alejandro has a strong track record building multiple departments of machine learning engineers from scratch, and leading the delivery of numerous large-scale machine learning systems across the financial, insurance, legal, transport, manufacturing and construction sectors (in Europe, US and Latin America).

Presentations

A practical guide towards explainability and bias evaluation in machine learning 3小时辅导课 (3-hour Tutorial)

Undesired bias in machine learning has become a worrying topic due to the numerous high profile incidents. In this talk we demystify machine learning bias through a hands-on example. We'll be tasked to automate the loan approval process for a company, and introduce key tools and techniques from latest research that allow us to assess and mitigate undesired bias in our machine learning models.

Maulik Soneji is currently working as a Data Engineer at Gojek where he works with different parts of data pipelines for a hyper-growth startup. Outside of learning about mature data systems, he is interested in elasticsearch, golang and kubernetes.

Presentations

Using ML for personalizing Food Recommendations 40分钟议题 (40-minute session)

The story of how we prototyped the search framework that personalizes the restaurant search results by using ML to learn what constitutes a relevant restaurant given a user's purchasing history

Guoqiong Song is a senior deep learning software engineer of the big data technology team at Intel. She has a PhD degree in atmospheric and oceanic sciences from UCLA, with a focus on numerical modling and optimization. Her interest is in developing and optimizing distributed deep learning algorithms on spark

Presentations

Real-time product recommendations leveraging deep learning on Apache Spark in Office Depot 40分钟议题 (40-minute session)

To show case how to build efficient recommender systems for e-commerce industry using deep learning technologies

Joseph Spisak is the product manager for Facebook’s AI open source platform, including PyTorch and ONNX. Previously, he led AI partnerships and deep learning product at Amazon Web Services, where he and his team were dedicated to building tools and solutions to help democratize deep learning for the developer community and ultimately accelerate the development of deep learning-based applications. Joseph holds a bachelor’s degree in electrical engineering from Michigan State University and an MBA and MS in finance from the University of Denver. He is a proud graduate of the Entrepreneurial and Innovation Program at Stanford University’s Graduate School of Business.

Presentations

Bringing Research And Production Together With PyTorch 1.0 40分钟议题 (40-minute session)

Learn how PyTorch 1.0 enables you to take state-of-the-art research and deploy it quickly at scale in areas from autonomous vehicles to medical imaging. We'll deep dive on the latest updates to the PyTorch framework including TorchScript and the JIT compiler, deployment support, the C++ interface. We will also cover how PyTorch 1.0 is utilized at Facebook to power AI across a variety of products.

Ion Stoica is a professor in the EECS Department at the University of California, Berkeley, where he does research on cloud computing and networked computer systems. Ion’s previous work includes dynamic packet state, chord DHT, internet indirection infrastructure (i3), declarative networks, and large-scale systems, including Apache Spark, Apache Mesos, and Alluxio. He is the cofounder of Databricks—a startup to commercialize Apache Spark—and Conviva—a startup to commercialize technologies for large-scale video distribution. Ion is an ACM fellow and has received numerous awards, including inclusion in the SIGOPS Hall of Fame (2015), the SIGCOMM Test of Time Award (2011), and the ACM doctoral dissertation award (2001).

Presentations

Keynote with Ion Stoica 主题演讲 (Keynote)

Keynote with Ion Stoica

Angus Taylor is a data scientist in the Cloud AI Group at Microsoft, where he builds data science solutions for external customers in the retail, energy, engineering, and package distribution sectors. He holds an MSc in AI from the University of Edinburgh.

Presentations

基于深度学习的时间序列预测 (Deep Learning for Time Series Forecasting) 3小时辅导课 (3-hour Tutorial)

Almost every business today uses forecasting to make better decisions and allocate resources more effectively. Deep learning has achieved a lot of success in computer vision, text and speech processing, but has only recently been applied to time series forecasting. In this tutorial we show how and when to apply deep neural networks to time series forecasting. The tutorial will be in CHN and EN.

Arun joined the Bloomberg Quantitative Research group in 2003. Prior to that, he earned his Ph.D from Cornell University in computer science & applied mathematics and a B. Tech in Computer Science from IIT Delhi. At Bloomberg, Arun’s early work focused on Stochastic Volatility Models for Derivatives & Exotics pricing & hedging, e.g., Variance Swaps and VIX
Futures fair pricing and Cross-Currency Volatility Surface construction. More recently, he has enjoyed working at the intersection of diverse areas such as data science, innovative quantitative finance models across asset classes and using machine learning methods to help reveal embedded signals in traditional & alternative data that can be used to construct quantitative trading strategies.

Arun lives in central New Jersey with his lovely wife and two children. He also serves on the board of a non-profit that helps with humanitarian projects in India serving impoverished children and women in the areas of education and vocational training.

Presentations

Trading strategies using Alternative data and Machine Learning 40分钟议题 (40-minute session)

We illustrate use of AI and ML techniques in Quantitative finance that lead to profitable trading strategies. Passive investing (or Quantamental investing) is now very popular and many techniques from deep learning, reinforcement learning as well as NLP and sentiment analysis are being used for a broad set of data sets such as News and Geolocational data.

Jiao (Jennie) Wang is a software engineer on the big data technology team at Intel, where she works in the area of big data analytics. She is engaged in developing and optimizing distributed deep learning framework on Apache Spark.

Presentations

Real-time product recommendations leveraging deep learning on Apache Spark in Office Depot 40分钟议题 (40-minute session)

To show case how to build efficient recommender systems for e-commerce industry using deep learning technologies

Long currently takes in charge of the R&D of AI and big data products and services of Tencent Cloud. After receiving bachelor degree from Tsinghua University, he worked in China, Germany and the US for more than 18 years, serving mainly MNCs such as eBay, Siemens, VMware and Cheetah Mobile etc. He was founder or co-founder of several start-ups. Prior to his current role in Tencent,he was responsible for VMware’s flagship cloud management product – vRealize Automation, and content recommendation system in Cheetah Mobile.

Presentations

Keynote by Long Wang, VP Tencent Cloud 主题演讲 (Keynote)

Keynote by Long Wang, VP Tencent Cloud

Lu is a data scientist / big data engineer from OfficeDepot, where he works on machine learning and big data analytics. He is engaged in developing distributed machine learning applications and real-time web services for OfficeDepot digital business platform.

Presentations

Real-time product recommendations leveraging deep learning on Apache Spark in Office Depot 40分钟议题 (40-minute session)

To show case how to build efficient recommender systems for e-commerce industry using deep learning technologies

Tiezhen Wang

Senior Software Engineer in Google

Presentations

Exciting new features in TensorFlow 2.0 40分钟议题 (40-minute session)

TensorFlow 2.0 is a major milestone with a focus on ease of use. This talk will give a in depth introduction to the new exciting features and best practices. Topics such as distributed strategies and edge deployment (TensorFlow Lite and TensorFlow.js) will also be covered.

Yang Wang is a machine learning engineer in Intel Data Analytics team, focusing on deep learning infrastructure, algorithms and applications. He is one of the core contributors of Analytics-Zoo and BigDL.

Presentations

Analytics Zoo: Distributed TensorFlow in Production on Apache Spark 40分钟议题 (40-minute session)

We will introduce Analytics Zoo, a unified analytics + AI platform for distributed TensorFlow, Keras and BigDL on Apache Spark, designed for production environment. It enables easy deployment, high performance and efficient model serving for deep learning applications.

王奕恒是腾讯云的高级研发工程师,主要方向是分布式机器学习,尤其是基于Apache Spark构建大规模数据分析平台。他还是Apache Spark上深度学习框架BigDL的主要贡献者。奕恒之前工作于Intel和摩根士丹利。

Presentations

Sparkling: 基于Apache Spark进行一站式机器学习 40分钟议题 (40-minute session)

机器学习项目在企业中实际落地往往涉及到复杂工作流构建和数据管理,以及多种工具的整合。而且随着数据规模的增加,团队规模的扩大,这一任务更具挑战性。Apache Spark是业界流行的大数据框架,被广泛的应用在海量数据的分析处理。本议题将介绍我们在腾讯云上如何基于Apache Spark为客户建立一个一站式机器学习平台的相关工作。主要内容包括多种数据源的接入,构建复杂数据管线,利用数据可视化理解数据,通过可插拔的机制使用各种流行的机器学习框架,以及部署和监控模型。我们也会分享在这一过程中遇到的问题和挑战。听众也可以了解到,通过这种和大数据紧密结合的一站式机器学习,用户可以怎样更加高效的建立和管理他们的机器学习项目,从而加速了机器学习在业务中的落地。

Pete Warden is the technical lead of the mobile and embedded TensorFlow Group on Google’s Brain team.

Presentations

Keynote by Pete Warden 主题演讲 (Keynote)

Keynote by Pete Warden

Bichen Wu is a PhD candidate at UC Berkeley, where he focuses on deep learning, computer vision, and autonomous driving.

Presentations

Efficient Deep Learning for the Edge 40分钟议题 (40-minute session)

The success of deep neural networks is attributed to three factors: stronger computing capacity, more complex neural networks, and more data. These factors, however, are usually not available with the edge applications as autonomous driving, AR/VR, IoT, and so on. In this talk we discuss how we apply AutoML, SW/HW codesign, domain adaptation to solve these problems.

Mingxi Wu is the vice president of engineering at TigerGraph, a Silicon Valley-based startup building a world-leading real-time graph database. Over his career, Mingxi has focused on database research and data management software. Previously, he worked in Microsoft’s SQL Server group, Oracle’s Relational Database Optimizer group, and Turn Inc.’s Big Data Management group. Lately, his interest has turned to building an easy-to-use and highly expressive graph query language. He has won research awards from the most prestigious publication venues in database and data mining, including SIGMOD, KDD, and VLDB and has authored five US patents with three more international patents pending. Mingxi holds a PhD from the University of Florida, specializing in both database and data mining.

Presentations

非监督学习在大规模图谱上的案例应用和开源算法剖析 40分钟议题 (40-minute session)

图数据上的非监督学习在激活大数据的经济价值上有着广泛和不可替代的作用。 PageRank能够发掘重要的实体, 社区发掘(community detection)可以找到具有某种特性的群体,紧密度中心性算法(Closeness Centrality)可以自动找到远离群体的个体。所有这些算法都是非监督的学习。 我们分享一些具体客户案例来展示他们的价值,同时分享怎样在大数据上灵活应用这些开源算法。

夏磊先生, 现任英特尔中国人工智能技术架构师,服务于英特尔数据中心技术销售部。专注于为客户在应用人工智能前沿技术过程中为客户的创新提供技术建议与指导提供,并提供英特尔产品与技术相关的支持。
夏磊先生于2000年加入英特尔,历任网络系统工程师、客户技术经理、渠道技术总监、云计算方案架构师、物联网端到端方案架构师,支持了国内信息产业在在互联网、数据中心、云计算与物联网术时代的持续技术创新。
夏磊先生获有机器人工程学士学位。在加入英特尔前任职于政府与教育行业的不同的技术开发和技术教育岗位,在软件算法、自动控制及工程管理等领域具有丰富经验。

Presentations

Low precision inference on Intel Architecture 40分钟议题 (40-minute session)

Vector Neural Network Instructions or VNNI is the new Intel instruction set for low precision AI inference inside next generation Xeon platform. This lecture is to introduce the features of the VNNI and Intel software tools to support developers to use this new instruction set to accelerate inference with INT8.

Vincent Xie (谢巍盛) is the Chief Scientist and Director of China Telecom BestPay Co., Ltd. He builds the company’s Artificial Intelligence Group and leads the team to carry out research related to big data and A.I. Previously, he worked for Intel leading an engineering team working on machine learning- and big data-related open source technologies.

Presentations

How China Telecom combats financial frauds with Adversarial AutoEncoder? 40分钟议题 (40-minute session)

We exploit the good representation capability of AAE (Adversarial AutoEncoder) in our risk factors modeling in fighting a special kind of financial frauds. It's one step of our long stack of unsupervised tasks, yet it's proved to be efficient and effective in our practice.

Hui Xue is currently an Associate Researcher in System Group, Microsoft Research Asia (MSRA). She obtained her master degree majoring in Natural Language Processing in July, 2016, from Peking University. Her interests in automated machine learning(AutoML), deep learning and natural language processing, especially their applications for chat-bot.

https://www.microsoft.com/en-us/research/people/xuehui/?preview=true&preview_nonce=b031f0fc93

Presentations

自动机器学习(automated machine learning)技术的实践与应用 40分钟议题 (40-minute session)

人工智能在过去的几年里飞速发展,但是机器学习的实践和应用需要消耗一定的人力和时间。例如,如何去做特征选择,如何设计一个适合该任务的神经网络模型等等。而自动机器学习技术,可以帮助开发者和机器学习实战者,缩短开发周期,提高效率。我们的介绍主要包括:自动机器学习技术的进展;我们开源的自动机器学习开源库Neural Network Intelligence; 如何利用自动机器学习的技术,在产品和应用上提高效率,节省所需的时间和缩短周期。我们会在最后一部分,分享一些利用自动特征选择,自动参数调整以及模型架构搜索上的成功案例。

Season Yang is an analytics fellow in McKinsey & Company’s Risk Practice. Previously, Season was a data scientist in residence at the Data Incubator, where he also contributes to curriculum development and instruction and worked at NASA’s Goddard space center, where he studied climate change models with data analysis. Season holds a double Bachelor’s degree in applied mathematics and scientific computation and economics from UC Davis, and a Master’s in applied mathematics from Columbia, specializing in numerical computation.

Presentations

Deep Learning with TensorFlow 2天培训 (2-day Training)

The TensorFlow library provides for the use of computational graphs, with automatic parallelization across resources. This architecture is ideal for implementing neural networks. This training will introduce TensorFlow's capabilities in Python. It will move from building machine learning algorithms piece by piece to using the Keras API provided by TensorFlow with several hands-on applications.

袁理 深圳普思英察科技有限公司 项目及产品总监

袁理拥有AI行业及金融IT行业工作10多年经验,2006年加入汇丰银行环球技术中心。2013年袁理作为汇丰银行风控部门对公信贷风险业务的资深技术架构师及IT项目经理,主要带领印度,中国及香港团队及协调美国、英国、法国团队支持汇丰银行核心及风控等系统研发升级、自动化和敏捷转型以及云端移植可行性探索,2017年袁理加入普思英察至今主要负责AI及无人车行业产品及项目落地以及解决方案预研及商业模式设定等主要工作。

Presentations

自动驾驶技术是如何应用于新潮传媒、新零售行业 40分钟议题 (40-minute session)

如何令自动驾驶技术落地并结合新潮传媒以及新零售业务,相关的技术是如何实现,商业模式是什么以及如何通过人工只能技术提升行业的效率。

Henry Zeng is a principal program manager in the Cloud AI Group at Microsoft, where he works with engineering team, partners and customers to ensure the success of ML platform. He has been in AI and data area for more than 10 years from database, NoSQL, Hadoop ecosystem, machine learning to deep learning. Prior to this role, he was the lead AI solution architect in Microsoft China working with partners and customer to land AI solutions in manufactory, retail, education and public service etc with Microsoft AI offerings. Henry holds a MS in computer science from Wuhan University.

Presentations

ONNX:开放和互操作平台让AI无处不在(AI everywhere: Open and interoperable platform for AI with ONNX) 40分钟议题 (40-minute session)

An open and interoperable ecosystem enables you to choose the framework that's right for you, train at scale, and deploy to cloud and edge. ONNX provides a common format supported by many popular frameworks and hardware accelerators. This session provides an introduction to ONNX and its core concepts. The session will be delivered in English and Chinese jointly.

基于深度学习的时间序列预测 (Deep Learning for Time Series Forecasting) 3小时辅导课 (3-hour Tutorial)

Almost every business today uses forecasting to make better decisions and allocate resources more effectively. Deep learning has achieved a lot of success in computer vision, text and speech processing, but has only recently been applied to time series forecasting. In this tutorial we show how and when to apply deep neural networks to time series forecasting. The tutorial will be in CHN and EN.

通过自动化机器学习民主化和加速AI落地 (Democratizing and Accelerating AI through Automated Machine Learning) 3小时辅导课 (3-hour Tutorial)

Intelligent experiences powered by AI can seem like magic to users. Developing them, however, is pretty cumbersome involving a series of sequential and interconnected decisions along the way that are pretty time consuming. What if there was an automated service that identifies the best machine learning pipelines for a given problem/data? Automated machine learning does exactly that!

Alina Zhang is Data Scientist at Skylinerunners Corporation and certified as Google Cloud Professional Data Engineer. She has authored [articles](https://medium.com/@alina.li.zhang) on Machine Learning, Exploratory Data Analysis, Data Visualization, etc.
Alina is driving Skylinerunners to provide small business with AI solutions. She applies Machine Learning models on user behavior analysis, recommendation system, and time series forecasting.
Before joining Skylinerunners, Alina was data scientist in Nobul. She was driving Nobul to evolve real estate in the cloud with Machine Learning technology to a variety of problems including property listing prediction, real estate chatbot with natural language processing, customer’s behavioral clustering, etc.
She worked for IBM as a software developer and WLM component owner of IBM DB2. Alina holds a Master Degree in Computer Science from Western University, where her research focused on high performance computing and Truncated Fourier Transform.

Presentations

Using deep learning and time-series forecasting to reduce transit delays 40分钟议题 (40-minute session)

Toronto is unique among North American cities for having a legacy streetcar network as an integral part of its transit system. This means streetcar delays are a major contributor to gridlock in the city. Using deep learning and time-series forecasting, we'll show how streetcar delays can be predicted... and prevented.

Maria is Vice President of Engineering for LinkedIn Talent Solutions (LTS) and Careers, which helps recruiters connect with quality talent and connects job-seekers with opportunity. Prior to LinkedIn, Maria served as CTO at Tinder, where she built a world-class team of engineers and scaled the app to serve a rapidly growing global user base. Maria also worked as Vice President of Engineering for Yahoo Mobile, and managed teams at Microsoft, Zillow.com and NetIQ Corp. She also founded Alike, a mobile local recommendation app, which was later acquired by Yahoo. She studied Computer Science at Tsinghua University and graduated Eastern Michigan University with a Bachelor and Master’s degree.

Presentations

Keynote by Maria Zhang 主题演讲 (Keynote)

Keynote by Maria Zhang

Provide technical consulting and training for Intel® AI software solutions, including Intel® MKL/MKL-DNN, Intel® OpenVINO™ , and Intel Performance libraries (IPP/MKL/DAAL) to Intel strategic customers in Asia-Pacific, enabling Intel internal and external customers to be successful with Intel platform through use Intel Software Technology and Products.

Presentations

Intel OpenVINO: Accelerating Deep learning inference and computer vision from edge to cloud 3小时辅导课 (3-hour Tutorial)

How Intel OpenVINO provides highly optimized cross-platform Deep learning deployment and visual AI solution based on various Intel architectures. And the structure and workflow of Intel OpenVINO™ toolkit, optimization methods by Asynchronies & heterogeneous computing, low precision inference, instruction set acceleration.

Weiqiang Zhuang is currently a senior software engineer in IBM’s Open Source Data and AI group with focus on building a cloud native pipeline solution for AI workflow. He was also the tech lead of the BigR machine learning project built on top of Hadoop. He has code contributions to Apache Spark, mlflow, Kubeflow, Apache SystemML and R4ML. He was also one of the core engineers for DB2’s process model component.

Presentations

AI Pipelines on container platform 40分钟议题 (40-minute session)

AI pipelines simplifies the lifecycle workflow management and enhances the reproducibility and collaboration for machine learning/deep learning. A cloud native platform solution is great at portability and scalability. Combining both strengths, AI pipelines on container platform can help accelerate both AI applications development and deployment.

刘怀军

美团研究员,美团外卖个性化技术负责人,负责外卖个性化搜索、排序和推荐工作。曾为腾讯搭建公司第一个智能反垃圾系统和智能问答系统,并负责搜搜查询分析,微信智能对话系统和微信搜索算法团队。发明专利20多篇,大部分已经授权。任中文信息学会社会媒体处理专委。

Presentations

AI技术在外卖个性化场景中的落地与思考 40分钟议题 (40-minute session)

该议题的内容包括: 1.外卖个性化场景:个性化搜索,个性化推荐 2.个性化产品形态包括:商家、商品、套餐等 3.外卖个性化中应用的AI技术包括:NLP,DNN,图像技术,强化学习 4.针对外卖业务的特点,介绍个性化场景中,几项重点AI技术的落地、挑战与思考

刘祁跃,爱奇艺智能平台部视频分析负责人,负责视频分析相关算法,包括短视频标签、行为识别、场景识别、目标检测、台词分析、音频分类等,以及视频精彩度分析和智能创作

Presentations

视频精彩度分析及智能创作 40分钟议题 (40-minute session)

对视频进行精彩度分析,有助于筛选优质内容,尤其是冷启动阶段 同时,基于算法对精彩内容的理解,可以辅助创作,如进行标题辅助生成、动态/精彩封面生成、智能拆条等 我们通过对视频、音频、文本等多模态内容分析,同时利用用户交互数据,建立了完备的视频精彩度分析系统,并落地在长/短视频的不同业务场景下,明显提升了业务产出质量和效率

姜涛,音乐检索(MIR)技术专家,有多年从业经验。

Presentations

AI“美颜”你的歌声和视频:K歌修音和自动作曲 40分钟议题 (40-minute session)

介绍如何综合应用多项人工智能技术进行K歌修音和短视频自动配乐,涉及的相关技术包括:人声/音乐分离、高精度的基频提取、自动作曲/作词技术、基于视频内容的音乐生成等。

He is currently working at Rakuten as data engineer and in charge of building the data science platform.

Presentations

Best practice of building data science platform in Rakuten 40分钟议题 (40-minute session)

Data Science Platform is a suite of tools for exploring data, training models, and running GPU/CPU compute jobs in an isolated container environment. It provides one click machine learning environment creation, powerful job scheduler and flexible "function as a service" component. It runs on Kubernetes and supports both on-premises and cloud environment, as well as hybrid mode.

中国地质大学北京与中国地质科学院联培在读研究生,研究课题是深度学习在地质学上的相关应用!

Presentations

基于目标检测的智能化成矿异常信息提取 40分钟议题 (40-minute session)

矿床所在的位置往往伴随着地质、地球物理、地球化学、遥感异常,因此,这些异常所在的位置也往往伴随着矿床的存在。所以,在找矿工作当中,一个重要的过程便是在地、物、化、遥数据中寻找异常,并将其整合,得出该区域成矿的概率,从而推断出靶区所在的位置。但传统方法并未考虑空间中点与点之间的相关关系。而卷积神经网络中的卷积和池化方法,充分考虑了点与点之间的相关关系。但单纯使用卷积神经网络只能进行特征提取,不能圈定异常所在的区域。因此,特将目标检测的相关算法引入其中,从而圈定异常所在的区域。

目前在阿里巴巴计算平台事业部PAI团队负责大规模深度学习算法基础设施相关建设工作,对大规模分布式机器学习的开发、建设、优化以及在不同业务场景中的落地应用有较为深入的理解和认识。之前先后在奇虎360担当广告技术部门架构师,Yahoo北京研发中心担当效果广告系统技术负责人。

Presentations

PAI Tensor Accelerator and Optimizer---Yet Another Deep Learning Compiler 40分钟议题 (40-minute session)

本次演讲会介绍阿里计算平台PAI团队过去一年多时间里在深度学习编译器领域的技术工作进展----PAI TAO(Tensor Accelerator and Optimizer)。PAI-TAO采用通用编译优化技术,来解决PAI平台所承载的多样性AI workload面临的训练及推理需求的性能优化问题,在部分workload上获得了20%到4X不等的显著加速效果,并且基本作到用户层全透明,在显著提升平台效率性能的同时也有效照顾了用户的使用惯性。目前PAI-TAO已经先后用于支持阿里内部搜索、推荐、图像、文本等多个业务场景的日常训练及推理需求。

杨博理,现任宜信大数据创新中心首席量化科学家,负责宜信线上财富管理平台上的量化投资策略研发、财务规划系统构建、以及AI在财富管理应用层面上的探索。华中科技大学博士后、博士,剑桥大学联合培养博士,里昂高等商学院访问学者。《量化炼金术——中低频量化交易策略研发》一书的作者。

Presentations

线上财富管理领域中的AI应用 40分钟议题 (40-minute session)

AI技术是线上财富管理领域中不可或缺的一环。在这个演讲中,我会将财富管理进一步细分为投资和实现财务目标两个方面,并分别讲解AI技术在这两个细分层面上的应用问题。对于投资而言,一些具备强金融逻辑的变量可能更适合使用机器学习进行预测。而在资产价格的预测上,可以尝试使用AI和大数据技术获取更多的有价值信息。对于实现财务目标而言,基于NLP技术的语义理解、引导式对话是理解用户的关键,基于AI和大数据的KYC也是判断用户状态的有效工具,而一个融合了财务规划、投资和精算知识的专家系统则是定制级规划的核心。

温浩,云从科技联合创始人。2003年获得中国科大电子科学与技术专业学士,并保送中国科大中科院量子信息重点实验室硕博连读,师从“量子调控”973首席科学家郭光灿院士,专攻量子通信器件和网络方向。2008年获得中国科大通信与信息系统博士学位,2014年加入中国科学院重庆绿色智能技术研究院。2015年和周曦博士共同创立云从科技。

Presentations

打造A.I.闭环 引领产业变革 40分钟议题 (40-minute session)

AI企业发展应该是一个从学术研究、行业验证、商业落地、行业平台到智能生态的一层层深入过程,这也是人工智能企业理想的发展阶段。 云从科技计划打造核心技术闭环,让计算机更好地服务人类。并将全面降低人工智能准入门槛,让“AI普惠”成为可能。

王书浩是透彻影像的联合创始人、技术总监,博士毕业于清华大学,清华大学交叉信息研究院博士后、助理研究员,曾于百度、NovuMind(异构智能)、京东从事人工智能研究,于EuroSys、ECML等会议发表多篇学术论文。

王书浩有着多年的人工智能实践经历,对深度学习有深入的研究,同时对深度学习在大规模集群的实施具有丰富的经验。

Presentations

人工智能病理影像辅助诊断系统——从方法到落地 40分钟议题 (40-minute session)

病理学是医学诊断的“金标准”,病理报告对于临床医生提供进一步治疗策略至关重要。一位能够独立发病理报告的病理医师需要10年以上的培养周期,我国目前共有约1万名注册在案的病理医师,根据WHO的要求,人才缺口为4-9万人。使用人工智能来辅助病理医师对样本进行诊断,不仅能够大幅提高医师的诊断效率,而且可以减少漏诊,提高诊断准确率。数字化的病理影像能够观察到组织的细胞形态,在最高倍数字扫描时,文件尺寸达到GB量级,需要从人工智能和系统工程的层面去应对这些挑战。在这个演讲中,我们将从人工智能系统的构建方法入手,介绍透彻影像与中国人民解放军总医院在消化道病理影像辅助系统研发过程中的技术细节。同时,我们将分享诊断系统从部署到落地使用的一些经验。

Dr. Yurong Chen is a Principle Research Scientist and Sr. Research Director at Intel Corporation, and Director of Cognitive Computing Lab at Intel Labs China. Currently, he’s responsible for driving cutting-edge Visual Cognition and Machine Learning research for Intel smart computing. He is also the co-owner of Intel Labs “Visual Understanding and Synthesis” program, driving research innovation in smart visual data processing technologies on Intel platforms across Intel Labs. He drove the research and development of Deep Learning (DL) based Visual Understanding (VU) and leading Face Analysis technologies to impact Intel architectures/platforms and delivered core technologies to help differentiate Intel products including Intel RealSense SDK, CV SDK, IOT video E2E analytics solutions and client apps. He led the team to win Intel China Award (Top team award of Intel China) 2016, Intel Labs Academic Awards (Top award of Intel labs) – Gordy Award 2016, 2015 and 2014 for outstanding research achievements on DL based VU, Multimodal Emotion Recognition and Advanced Visual Analytics. Dr. Chen joined Intel in 2004 after finishing his postdoctoral research in the Institute of Software, CAS. He received his Ph.D. degree from Tsinghua University in 2002. He has published over 50 technical papers, and holds 10+ issued/pending US/PCT patents and 30+ patent applications.

Presentations

在边缘实现深度学习 40分钟议题 (40-minute session)

深度学习在许多领域尤其是视觉识别/理解方面取得了巨大突破,但它在训练和部署方面都存在一些挑战。本讲座将介绍我们通过高效CNN算法设计、领先DNN模型压缩技术和创新部署时DNN网络结构优化来解决深度学习部署挑战的前沿研究成果。

陈薇博士,现任排列科技首席科学家,江西互联网金融协会特聘风控专家,博金贷金融科技研究院院长。
之前,陈薇曾任职于Lendingclub (NYSE:LC) 任首席数据科学家,负责风险管理相关技术创新,开创性将机器学习与文本数据挖掘系统引入P2P贷款风险分析,取得非常良好的效果,并极大缩短了研发周期,主导的非传统风险模型与决策算法的研究与开发,使公司风控水准远高于美国传统银行。再之前,陈薇曾任Paypal(NYSE:PYPL)主任信贷分析师,专注线上交易风险识别和分析,尤其是银行交易的风险分析和建模设计,创新性将大数据,人工智能和机器学习运用于风险识别和决策。持有内布拉斯加大学计算机科学系博士学位,清华大学计算机工程系硕士及中国人工智能重点实验室成员,曾担任数个学术期刊评审,发表专业论文数十篇。

Presentations

量化互联网金融信用与反欺诈风控 2天培训 (2-day Training)

您想了解金融企业是怎样利用大数据和人工智能技术来画像个人行为并检测欺诈用户的吗?互联网金融幕后的量化分析流程是怎么杨的?个人信用是怎样通过大数据被量化的?在实践过程中,机器学习算法的应用存在着哪些需要关注的方面?怎样通过图谱分析来融合多维数据,为我们区分正常用户和欺诈用户? 这套辅导课基于清华大学交叉信息研究院开设的一门"量化金融信用与风控分析”研究生课。其中会用LendingClub的真实借贷数据做为案例,解说一些具体模型的实现。

中国人寿研发中心高级工程师,自2014年从事大数据相关项目开发及管理。2016年开始研究机器学习模型的构建与实施,已主导多个模型落地实施。

Presentations

保险中的机器学习实践 40分钟议题 (40-minute session)

分析保险行业人工智能发展情况及现有数据特性,评估机器学习模型构建的主流工具、语言、算法。总结基于机器学习技术,实现一个保险业人工智能场景的全流程——从场景研讨、数据加工提取到模型构建、模型效果评估、模型落地实施。以一个真实的机器学习模型项目为例,介绍整个方法论不同环节中各方人员的参与工作内容和比例,探讨特征稳定性、样本不均衡、参数选择、模型可解释性等环节的难点及尝试方案。为金融或者其他行业的机器学习项目落地提供参考和指导。

黄铃,慧安金科(北京)科技有限公司创始人、CEO,清华大学交叉信息研究院兼职教授。主要技术背景是人工智能、信息安全和金融风控。他是全球为数不多的同时精通人工智能和计算机安全的顶级专家,在美国加州大学伯克利分校获得计算机科学博士 (2002-2007),师从 Anthony Joseph 和 Michael Jordan ,从事机器学习算法研究以及计算机网络建模应用。他是美国硅谷著名的反欺诈公司DataVisor的创始成员和大数据总监 (2014-1016),主持了公司整个机器学习,用户行为分析和信用分析系统。他在美国英特尔研究院任资深科学家七年(2007-2014),和 Intel McAfee 开展多个合作项目,应用人工智能技术解决网络和数据安全问题。他在人工智能,大数据分析和金融科技相关领域有近十五年的研究和开发背景,在世界顶尖会议上发表近50篇论文,在 Google Scholar 上总引用已超过5,000次。

Presentations

量化互联网金融信用与反欺诈风控 2天培训 (2-day Training)

您想了解金融企业是怎样利用大数据和人工智能技术来画像个人行为并检测欺诈用户的吗?互联网金融幕后的量化分析流程是怎么杨的?个人信用是怎样通过大数据被量化的?在实践过程中,机器学习算法的应用存在着哪些需要关注的方面?怎样通过图谱分析来融合多维数据,为我们区分正常用户和欺诈用户? 这套辅导课基于清华大学交叉信息研究院开设的一门"量化金融信用与风控分析”研究生课。其中会用LendingClub的真实借贷数据做为案例,解说一些具体模型的实现。

目前在阿里巴巴PAI团队负责GPU底层核心优化工作,之前在中科院软件所从事计算机系统结构相关研究工作,对高性能计算、微处理器设计、异构计算领域有较深入的理解和认识,先后有多篇论文在PPoPP、Micro、ACL等体系结构及AI领域顶级会议发表。

Presentations

PAI Tensor Accelerator and Optimizer---Yet Another Deep Learning Compiler 40分钟议题 (40-minute session)

本次演讲会介绍阿里计算平台PAI团队过去一年多时间里在深度学习编译器领域的技术工作进展----PAI TAO(Tensor Accelerator and Optimizer)。PAI-TAO采用通用编译优化技术,来解决PAI平台所承载的多样性AI workload面临的训练及推理需求的性能优化问题,在部分workload上获得了20%到4X不等的显著加速效果,并且基本作到用户层全透明,在显著提升平台效率性能的同时也有效照顾了用户的使用惯性。目前PAI-TAO已经先后用于支持阿里内部搜索、推荐、图像、文本等多个业务场景的日常训练及推理需求。