Оцените презентацию от 1 до 5 баллов!
Тип файла:
ppt / pptx (powerpoint)
Всего слайдов:
35 слайдов
Для класса:
1,2,3,4,5,6,7,8,9,10,11
Размер файла:
6.43 MB
Просмотров:
63
Скачиваний:
0
Автор:
неизвестен
Слайды и текст к этой презентации:
№1 слайд![High Performance Deep](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img0.jpg)
Содержание слайда: High Performance Deep Learning on Intel® Architecture
Ivan Kuzmin
Engineering Manager for AI Performance Libraries
December 19, 2016
№2 слайд![Fast Evolution of Technology](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img1.jpg)
Содержание слайда: Fast Evolution of Technology
№3 слайд![Classical Machine Learning](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img2.jpg)
Содержание слайда: Classical Machine Learning
№4 слайд![Deep learning](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img3.jpg)
Содержание слайда: Deep learning
№5 слайд![End-to-End Deep Learning](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img4.jpg)
Содержание слайда: End-to-End Deep Learning
№6 слайд![Automating previously human](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img5.jpg)
Содержание слайда: Automating previously “human” tasks
№7 слайд![Deep Learning Challenges](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img6.jpg)
Содержание слайда: Deep Learning Challenges
№8 слайд![Deep Learning Challenges](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img7.jpg)
Содержание слайда: Deep Learning Challenges
№9 слайд![Scaling is I O Bound](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img8.jpg)
Содержание слайда: Scaling is I/O Bound
№10 слайд![Intel Provides the Compute](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img9.jpg)
Содержание слайда: Intel Provides the Compute Foundation for DL
№11 слайд![INTEL MKL-DNN](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img10.jpg)
Содержание слайда: INTEL® MKL-DNN
№12 слайд![Deep learning with Intel](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img11.jpg)
Содержание слайда: Deep learning with Intel® MKL-DNN
№13 слайд![Deep learning with Intel](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img12.jpg)
Содержание слайда: Deep learning with Intel® MKL-DNN
№14 слайд![Deep learning with Intel](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img13.jpg)
Содержание слайда: Deep learning with Intel® MKL-DNN
№15 слайд![Intel Xeon Phi processor up](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img14.jpg)
Содержание слайда: Intel® Xeon Phi ™ processor 7250 up to 400x performance increase with Intel Optimized Frameworks compared to baseline out of box performance
№16 слайд![Intel Xeon Phi processor](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img15.jpg)
Содержание слайда: Intel® Xeon Phi ™ processor Knights Mill up to 4x estimated performance improvement over Intel® Xeon Phi™ processor 7290
№17 слайд![INTEL Machine Learning](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img16.jpg)
Содержание слайда: INTEL® Machine Learning Scaling Library
№18 слайд![Intel Machine Learning](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img17.jpg)
Содержание слайда: Intel® Machine Learning Scaling Library (MLSL)
Deep learning abstraction of message-passing implementations.
Built on top of MPI, allows other communication libraries to be used
Optimized to drive scalability of communication patterns
Works across various interconnects: Intel® Omni-Path Architecture, InfiniBand, and Ethernet
Common API to support Deep Learning frameworks (Caffe, Theano, Torch etc.)
№19 слайд![Intel Xeon Phi Processor](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img18.jpg)
Содержание слайда: Intel® Xeon Phi™ Processor 7250 GoogleNet V1 Time-To-Train Scaling Efficiency up to 97% on 32 nodes
№20 слайд![NeON framework](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img19.jpg)
Содержание слайда: NeON framework
№21 слайд![Neon DL Framework with](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img20.jpg)
Содержание слайда: Neon: DL Framework with Blazing Performance
№22 слайд![Intel Nervana Graph Compiler](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img21.jpg)
Содержание слайда: Intel® Nervana™ Graph Compiler
Intel® Nervana™ Graph Compiler:
High-level execution graph
for neural networks to enable
optimizations that are applicable
across multiple HW targets.
№23 слайд![Intel Nervana Graph Compiler](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img22.jpg)
Содержание слайда: Intel® Nervana™ Graph Compiler as the performance building block…
№24 слайд![INTEL DEEP Learning SDK](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img23.jpg)
Содержание слайда: INTEL® DEEP Learning SDK
№25 слайд![Intel Deep Learning SDK](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img24.jpg)
Содержание слайда: Intel® Deep Learning SDK
Accelerate Your Deep Learning Solution
A free set of tools for data scientists and software developers to develop, train, and deploy deep learning solutions
№26 слайд![Deep Learning Training Tool](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img25.jpg)
Содержание слайда: Deep Learning Training Tool
Intel® Deep Learning SDK
Simplify installation of Intel optimized Deep Learning Frameworks
Easy and Visual way to Set-up, Tune and Run Deep Learning Algorithms:
Create training dataset
Design model with automatically optimized hyper-parameters
Launch and monitor training of multiple candidate models
Visualize training performance and accuracy
№27 слайд![Deep Learning Deployment Tool](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img26.jpg)
Содержание слайда: Deep Learning Deployment Tool
Intel® Deep Learning SDK
Unleash fast scoring performance on Intel products while abstracting the HW from developers
Imports trained models from all popular DL framework regardless of training HW
Compresses model for improved execution, storage & transmission (pruning, quantization)
Generate Inference HW-Specific Code (C/C++, OpenVX, OpenCL, etc.)
Enables seamless integration with full system / application software stack
№28 слайд![Deep Learning Tools for](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img27.jpg)
Содержание слайда: Deep Learning Tools for End-to-End Workflow
Intel® Deep Learning SDK
№29 слайд![Leading AI research](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img28.jpg)
Содержание слайда: Leading AI research
№30 слайд![Summary Intel provides highly](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img29.jpg)
Содержание слайда: Summary
Intel provides highly optimized libraries to accelerate all DL frameworks
Intel® Machine Learning Scaling Library (MLSL) allow to scale DL to 32 nodes and beyond
Nervana graph compiler, next innovation for DL performance
Intel® Deep Learning SDK make it easy for you to start exploring DeepLearning
Intel is committed to provide algorithmic, SW and HW innovations to get best performance for DL on IA
Get more details at:
https://software.intel.com/en-us/ai/deep-learning
№31 слайд![Legal Disclaimer amp](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img30.jpg)
Содержание слайда: Legal Disclaimer & Optimization Notice
INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
Copyright © 2016, Intel Corporation. All rights reserved. Intel, Pentium, Xeon, Xeon Phi, Core, VTune, Cilk, and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries.
№32 слайд![](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img31.jpg)
№33 слайд![Configuration details](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img32.jpg)
Содержание слайда: Configuration details
BASELINE: Caffe Out Of the Box, Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory, Centos 7.2 based on Red Hat* Enterprise Linux 7.2, BVLC-Caffe: https://github.com/BVLC/caffe, with OpenBLAS, Relative performance 1.0
NEW: Caffe: Intel® Xeon Phi™ processor 7250 (68 Cores, 1.4 GHz, 16GB MCDRAM: cache mode), 96GB memory, Centos 7.2 based on Red Hat* Enterprise Linux 7.2, Intel® Caffe: : https://github.com/intel/caffe based on BVLC Caffe as of Jul 16, 2016, MKL GOLD UPDATE1, Relative performance up to 400x
AlexNet used for both configuration as per https://papers.nips.cc/paper/4824-Large image database-classification-with-deep-convolutional-neural-networks.pdf, Batch Size: 256
№34 слайд![Configuration details](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img33.jpg)
Содержание слайда: Configuration details
BASELINE: Intel® Xeon Phi™ Processor 7290 (16GB, 1.50 GHz, 72 core) with 192 GB Total Memory on Red Hat Enterprise Linux* 6.7 kernel 2.6.32-573 using MKL 11.3 Update 4, Relative performance 1.0
NEW: Intel® Xeon phi™ processor family – Knights Mill, Relative performance up to 4x
№35 слайд![Configuration details](/documents_6/40fb8b4fae54a3342e64ce0a577618b4/img34.jpg)
Содержание слайда: Configuration details