1 |
Course Title: |
BIG DATA |
2 |
Course Code: |
IYS4213 |
3 |
Type of Course: |
Optional |
4 |
Level of Course: |
First Cycle |
5 |
Year of Study: |
4 |
6 |
Semester: |
7 |
7 |
ECTS Credits Allocated: |
6 |
8 |
Theoretical (hour/week): |
3 |
9 |
Practice (hour/week) : |
0 |
10 |
Laboratory (hour/week) : |
0 |
11 |
Prerequisites: |
|
12 |
Recommended optional programme components: |
None |
13 |
Language: |
Turkish |
14 |
Mode of Delivery: |
Face to face |
15 |
Course Coordinator: |
Doç. Dr. MELİH ENGİN |
16 |
Course Lecturers: |
Doç.Dr. Melih ENGİN |
17 |
Contactinformation of the Course Coordinator: |
Doç.Dr. Melih ENGİN 0224 294 26 95 melihengin@uludag.edu.tr |
18 |
Website: |
|
19 |
Objective of the Course: |
This course covers analytical data generation, storage, management, transfer, in-depth analysis of incoming big data, covering the use of existing technologies, tools, architectures and systems in Big Data. Data processing solutions in high performance networks that provide a coverage area. It examines the bigdata applications emerging in various fields and tests the widely used big data applications, including application and development. It will also focus on data mining and machine learning algorithms to analyze big data. |
20 |
Contribution of the Course to Professional Development |
To be able to design the systems necessary for an enterprise and to produce solutions for the needs of iders.
|
21 |
Learning Outcomes: |
1 |
"Theory" and "practice" are taken in a balanced way to teach the student to understand, use and practice analysis of big data analysis and management. The course is designed to provide the student with an understanding of the issues and problems involved in large on-line warehouse systems, it plans to provide a basis for the solutions of today's practical techniques, knowledge and research approaches of tomorrow to meet the needs of such a system.;
|
2 |
They will learn big data concepts, terminology, data analytics features, big data types such as 5V-structural-non-structural-metadata.;
|
3 |
Understands analysis techniques such as qualitative - quantitative data mining, statistical analysis, A / B testing, correlation, regression analysis.;
|
4 |
They will be able to master storage concepts such as clustering, distributed file systems, relational database systems, NoSQL, in-memory storage and big data processing concepts such as parallel, distributed, and mass data processing. ;
|
5 |
They comprehend parallel processing and other design patterns such as Cloudera virtual machine, HDFS (Hadoop Distributed File System), YARN (Yet Another Resource Negotiator and Hue) in processing big data.;
|
|
Week |
Theoretical |
Practical |
1 |
Introduction to Big Data: Covers concepts, terminology, features, and big Data types such as 5V, structured, unstructured, semi-structured and metadata. It covers business and research motivations. |
|
2 |
Storage and Analysis in Big Data: It covers storage concepts such as clusters, distributed file systems, RDBMS, NoSQL, sharding, memory storage, as well as big data processing concepts such as parallel, distributed, mass data processing and Hadoop. |
|
3 |
Big Data Analysis Techniques: It covers analysis techniques and topics such as quantitative, qualitative data mining, statistical analysis, Machine Learning, semantic analysis and visual data analysis. |
|
4 |
MapReduce Framework and Hadoop: Covers parallel processes and other design patterns for big data processing. Cloudera virtual machine. HDFS, YARN and Hue. |
|
5 |
Mapreduce API and Basic Programming with Java: We will examine the Hadoop MapReduce Java API. |
|
6 |
Hive usage: "data warehouse" HDFS and Hadoop. SQL queries on stored data. |
|
7 |
Using Spark: Memory-based data analysis of the MapReduce framework and RDD-Spark. |
|
8 |
Flume and Kafka training |
|
9 |
Spark data stream, Kafka and Cassandra: Standardized stack use for processing fast data. |
|
10 |
MLLib with Spark, Machine Learning: We will review learning algorithms to make predictions on data. |
|
11 |
Visualization in Big Data Clusters: We will see a graphical analysis of the content and properties of big data. |
|
12 |
Advanced topics and applications in big data. |
|
13 |
Advanced topics and applications in big data. |
|
14 |
exercise week. |
|
23 |
Textbooks, References and/or Other Materials: |
1. Big Data Fundamentals: Concepts, Drivers & Techniques (1st ed.). Thomas Erl, Wajid Khattak, and Paul Buhler. Prentice Hall Press, Upper Saddle River, NJ, USA. 2016. 2. Big Data, Principles and Best Practices of Scalable Realtime Data Systems, Nathan Marz and James Warren, Manning Publications 2015. 3. Hadoop: The Definitive Guide, Tom White, O’Reilly, 2015. |
24 |
Assesment |
|