Large Scale Data Analytics

This book presents a language integrated query framework for big data. The continuous, rapid growth of data information to volumes of up to terabytes (1,024 gigabytes) or petabytes (1,048,576 gigabytes) means that the need for a system to manage and query information from large scale data sources is...

Full description

Bibliographic Details
Main Authors: Cho, Chung Yik, Tan, Rong Kun Jason (Author), Leong, John A. (Author), Sidhu, Amandeep S. (Author)
Format: eBook
Language:English
Published: Cham Springer International Publishing 2019, 2019
Edition:1st ed. 2019
Series:Data, Semantics and Cloud Computing
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 02182nmm a2200313 u 4500
001 EB001860368
003 EBX01000000000000001024464
005 00000000000000.0
007 cr|||||||||||||||||||||
008 190201 ||| eng
020 |a 9783030038922 
100 1 |a Cho, Chung Yik 
245 0 0 |a Large Scale Data Analytics  |h Elektronische Ressource  |c by Chung Yik Cho, Rong Kun Jason Tan, John A. Leong, Amandeep S. Sidhu 
250 |a 1st ed. 2019 
260 |a Cham  |b Springer International Publishing  |c 2019, 2019 
300 |a IX, 89 p  |b online resource 
505 0 |a Introduction -- Background -- Large Scale Data Analytics -- Query Framework -- Results and Discussion -- Conclusion and Future Works 
653 |a Applied mathematics 
653 |a Engineering mathematics 
653 |a Mathematical and Computational Engineering 
700 1 |a Tan, Rong Kun Jason  |e [author] 
700 1 |a Leong, John A.  |e [author] 
700 1 |a Sidhu, Amandeep S.  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a Data, Semantics and Cloud Computing 
856 4 0 |u https://doi.org/10.1007/978-3-030-03892-2?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 519 
520 |a This book presents a language integrated query framework for big data. The continuous, rapid growth of data information to volumes of up to terabytes (1,024 gigabytes) or petabytes (1,048,576 gigabytes) means that the need for a system to manage and query information from large scale data sources is becoming more urgent. Currently available frameworks and methodologies are limited in terms of efficiency and querying compatibility between data sources due to the differences in information storage structures. For this research, the authors designed and programmed a framework based on the fundamentals of language integrated query to query existing data sources without the process of data restructuring. A web portal for the framework was also built to enable users to query protein data from the Protein Data Bank (PDB) and implement it on Microsoft Azure, a cloud computing environment known for its reliability, vast computing resources and cost-effectiveness