Big Data Analytics For MU Semester 8 Information Technology (Code : ITDO8011) Academic Year 2022-2023

Big Data Analytics For MU Semester 8 Information Technology (Code : ITDO8011) Academic Year 2022-2023  (Paperback, Dr. Sangeeta Vhatkar, Prof. Dipali K. Pawar, Prof. Rupali D. Pashte, Dr. Zahir Aalm)

Price: Not Available
Currently Unavailable
Highlights
  • Author: Dr. Sangeeta Vhatkar, Prof. Dipali K. Pawar, Prof. Rupali D. Pashte, Dr. Zahir Aalm
  • 210 Pages
  • Language: English
  • Publisher: Tech-Neo Publications
Description
0 Prerequisite Data Mining, Data Science 02 I Introduction to Big Data Introduction to Big Data, Big Data characteristics, types of Big Data, Traditional vs. Big Data business approach, Big Data Challenges, Examples of Big Data in Real Life, Big Data Applications Self-learning Topics: Identification of Big Data applications and its solutions. (Refer chapter 1) 03 CO1 II Introduction to Big Data Frameworks What is Hadoop? Core Hadoop Components; Hadoop Ecosystem; Working with Apache Spark What is NoSQL? NoSQL data architecture patterns: Key- value stores, Graph stores, Column family (Bigtable) stores, Document stores, MongoDB Self-learning Topics: HDFS vs GFS, MongoDB vs other NoSQL system, Implementation of Apache Spark. (Refer chapter 2) 06 CO2 III MapReduce Paradigm MapReduce: The Map Tasks, Grouping by Key, The Reduce Tasks, Combiners, Details of MapReduce Execution, Coping With Node Failures. Algorithms Using MapReduce: Matrix- Vector Multiplication by MapReduce , Relational-Algebra Operations, Computing Selections by MapReduce, Computing Projections by MapReduce, Union, Intersection, and Difference by MapReduce, Computing Natural Join by MapReduce, Grouping and Aggregation by MapReduce, Matrix Multiplication, Matrix Multiplication with One MapReduce Step . Illustrating use of MapReduce with use of real life databases and applications. Self-learning Topics : Implementation of MapReduce algorithms like Word count, Matrix-Vector and Matrix- Matrix algorithm. (Refer chapter 3) 07 CO3 IV Mining Big Data Streams The Stream Data Model: A DataStream-Management System, Examples of Stream Sources, Stream Queries, Issues in Stream Processing. Sampling Data in a Stream : Sampling Techniques. Filtering Streams: The Bloom Filter Counting Distinct Elements in a Stream : The Count-Distinct Problem, The Flajolet-Martin Algorithm, Combining Estimates, Space Requirements . Counting Ones in a Window: The Cost of Exact Counts, The Datar-Gionis-Indyk, Motwani Algorithm, Query Answering in the DGIM Algorithm. Self-learning Topics : Streaming services like Apache Kafka/Amazon Kinesis/Google Cloud DataFlow. Standard spark streaming library. Integration with IOT devices to capture real time stream data. (Refer chapter 4) 07 CO4 V Big Data Mining Algorithms Frequent Pattern Mining : Handling Larger Datasets in Main Memory Basic Algorithm of Park, Chen, and Yu. The SON Algorithm and MapReduce. Clustering Algorithms: CURE Algorithm. Canopy Clustering, Clustering with MapReduce Classification Algorithms: Overview SVM classifiers, Parallel SVM, KNearest Neighbor classifications for Big Data, One Nearest Neighbour. Self-learning Topics : Standard libraries included with spark like graphX, MLlib. (Refer chapter 5) 07 CO5 VI Big Data Analytics Applications Link Analysis : PageRank Definition, Structure of the web, dead ends, Using Page rank in a search engine, Efficient computation of Page Rank: PageRank Iteration Using MapReduce, Topic sensitive Page Rank, link Spam, Hubs and Authorities, HITS Algorithm. Mining Social-Network Graphs : Social Networks as Graphs, Types , Clustering of Social Network Graphs, Direct Discovery of Communities, Counting triangles using Map- Reduce. Recommendation Engines : A Model for Recommendation Systems, Content-Based Recommendations, Collaborative Filtering Self-learning Topics : Sample applications like social media feeds, multiplayer game interactions, retail industry, financial data analysis. Use case like location data, real-time stock trades, log monitoring etc. (Refer chapter 6) 07 CO6
Read More
Specifications
Book
  • Big Data Analytics For MU Semester 8 Information Technology (Code : ITDO8011) Academic Year 2022-2023
Author
  • Dr. Sangeeta Vhatkar, Prof. Dipali K. Pawar, Prof. Rupali D. Pashte, Dr. Zahir Aalm
Binding
  • Paperback
Publishing Date
  • 2023
Publisher
  • Tech-Neo Publications
Edition
  • 1
Board
  • MU
Exam
  • MU
Standard
  • MU
Number of Pages
  • 210
Language
  • English
Subject
  • Big Data Analytics Strictly as per the New Syllabus (REV-2019 ‘C’ Scheme) of Mumbai University w.e.f. academic year 2022-2023 Semester 8 : Information Technology (Course Code : ITDO8011) (Department Optional Course – 5)
Age Group
  • 10-60
Specialization
  • Information Technology
University
  • MU
Genre
  • Academic & Test Preparation
Book Subcategory
  • Other Books
Degree/Diploma
  • MU Degree
Term/Year
  • 4 Year
Term/Semester
  • 8 Semester
Author Info
  • Prof. Dipali K. Pawar, Dr. Sangeeta Vhatkar, Prof. Rupali D. Pashte, Dr. Zahir Aalm
University/Subject
  • MU
Be the first to ask about this product
Safe and Secure Payments.Easy returns.100% Authentic products.
Back to top