Презентация Big data concepts and tools онлайн

На нашем сайте вы можете скачать и просмотреть онлайн доклад-презентацию на тему Big data concepts and tools абсолютно бесплатно. Урок-презентация на эту тему содержит всего 16 слайдов. Все материалы созданы в программе PowerPoint и имеют формат ppt или же pptx. Материалы и темы для презентаций взяты из открытых источников и загружены их авторами, за качество и достоверность информации в них администрация сайта не отвечает, все права принадлежат их создателям. Если вы нашли то, что искали, отблагодарите авторов - поделитесь ссылкой в социальных сетях, а наш сайт добавьте в закладки.
Презентации » Информатика » Big data concepts and tools



Оцените!
Оцените презентацию от 1 до 5 баллов!
  • Тип файла:
    ppt / pptx (powerpoint)
  • Всего слайдов:
    16 слайдов
  • Для класса:
    1,2,3,4,5,6,7,8,9,10,11
  • Размер файла:
    4.51 MB
  • Просмотров:
    148
  • Скачиваний:
    2
  • Автор:
    неизвестен



Слайды и текст к этой презентации:

№1 слайд
big data concepts and tools
Содержание слайда: big data concepts and tools PERFORMED BY: Bondarev Petr, 433

№2 слайд
introduction The term quot
Содержание слайда: introduction The term "Big Data" has launched a veritable industry of processes, personnel and technology to support what appears to be an exploding new field. Giant companies like Amazon and Wal-Mart as well as bodies such as the U.S. government and NASA are using Big Data to meet their business and/or strategic objectives. Big Data can also play a role for small or medium-sized companies and organizations that recognize the possibilities (which can be incredibly diverse) to capitalize upon the gains.

№3 слайд
Содержание слайда:

№4 слайд
Why Are Big Data Systems
Содержание слайда: Why Are Big Data Systems Different? An exact definition of "big data" is difficult to nail down because projects, vendors, practitioners, and business professionals use it quite differently. With that in mind, generally speaking, big data is: large datasets the category of computing strategies and technologies that are used to handle large datasets

№5 слайд
Why Are Big Data Systems
Содержание слайда: Why Are Big Data Systems Different? The basic requirements for working with big data are the same as the requirements for working with datasets of any size. However, the massive scale, the speed of ingesting and processing, and the characteristics of the data that must be dealt with at each stage of the process present significant new challenges when designing solutions. The goal of most big data systems is to surface insights and connections from large volumes of heterogeneous data that would not be possible using conventional methods. In 2001, Gartner's Doug Laney first presented what became known as the "three Vs of big data" to describe some of the characteristics that make big data different from other data processing:

№6 слайд
Содержание слайда:

№7 слайд
Other Characteristics
Содержание слайда: Other Characteristics Veracity: The variety of sources and the complexity of the processing can lead to challenges in evaluating the quality of the data (and consequently, the quality of the resulting analysis) Variability: Variation in the data leads to wide variation in quality. Additional resources may be needed to identify, process, or filter low quality data to make it more useful. Value: The ultimate challenge of big data is delivering value. Sometimes, the systems and processes in place are complex enough that using the data and extracting actual value can become difficult.

№8 слайд
Содержание слайда:

№9 слайд
tools There are thousands of
Содержание слайда: tools There are thousands of Big Data tools out there for data analysis today. Data analysis is the process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision making.

№10 слайд
Great product from Apache
Содержание слайда: Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product. Great product from Apache that has been used by many large corporations. Among the most important features of this advanced software library is superior processing of voluminous data sets in clusters of computers using effective programming models. Corporations choose Hadoop because of its great processing capabilities plus developer provides regular updates and improvements to the product.

№11 слайд
This tool is widely used
Содержание слайда: This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them. This tool is widely used today because it provides an effective management of large amounts of data. It is a database that offers high availability and scalability without compromising the performance of commodity hardware and cloud infrastructure. Among the main advantages of Cassandra highlighted by the development are fault tolerance, performance, decentralization, professional support, durability, elasticity, and scalability. Indeed, such users of Cassandra as eBay and Netflix may prove them.

№12 слайд
This tool makes the list
Содержание слайда: This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities. This tool makes the list because of its superior streaming data processing capabilities in real time. It also integrates with many other tools such as Apache Slider to manage and secure the data. The use cases of Storm include data monetization, real time customer management, cyber security analytics, operational dashboards, and threat detection. These functions provide awesome business opportunities.

№13 слайд
The HPCC platform combines a
Содержание слайда: The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications. The HPCC platform combines a range of big data analysis tools. It is a package solution with tools for data profiling, cleansing, job scheduling and automation. Like Hadoop, it also leverages commodity computing clusters to provide high-performance, parallel data processing for big data applications. It uses ECL (a language specially designed to work with big data) as the scripting language for ETL engine. The HPCC platform supports both parallel batch data processing (Thor) and real-time query applications using indexed data files (Roxie).

№14 слайд
Elasticsearch is a dependable
Содержание слайда: Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data. Elasticsearch is a dependable and safe open source platform where you can take any data from any source, in any format and search, analyze it and envision it in real time. Elasticsearch is designed for horizontal scalability, reliability, and ease of management.  All of this achieved while combining the speed of search with the potential of analytics. It is based on Lucene a retrieval software library originally compiled in Java. It uses a developer-friendly, JSON-style, query language that works well for structured, unstructured and time-series data.

№15 слайд
Thanks for your attention!
Содержание слайда: Thanks for your attention!

№16 слайд
Sources https
Содержание слайда: Sources https://www.digitalocean.com/community/tutorials/an-introduction-to-big-data-concepts-and-terminology https://www.techrepublic.com/blog/big-data-analytics/big-data-basic-concepts-and-benefits-explained/ https://bluewavebuzzblog.wordpress.com/2014/03/20/what-is-big-data-and-how-can-it-help-marketers/ http://bigdata-madesimple.com/top-10-tools-for-working-with-big-data-for-successful-analytics-developers-2/ https://www.newgenapps.com/blog/top-best-open-source-big-data-tools

Скачать все slide презентации Big data concepts and tools одним архивом: