Data Science in Theory and Practice: Techniques for Big Data Analytics and Complex Data Sets, Mariani Maria C., Tweneboah Osei Kofi, Beccar-Varela Maria Pia
Описание: Detect fraud earlier to mitigate loss and prevent cascading damage Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques is an authoritative guidebook for setting up a comprehensive fraud detection analytics solution.
As data holdings get bigger and questions get harder, data scientists and analysts must focus on the systems, the tools and techniques, and the disciplined process to get the correct answer, quickly Whether you work within industry or government, this book will provide you with a foundation to successfully and confidently process large amounts of quantitative data.
Here are just a dozen of the many questions answered within these pages:
What does quantitative analysis of a system really mean?
What is a system?
What are big data and analystics?
How do you know your numbers are good?
What will the future data science environment look like?
How do you determine data provenance?
How do you gather and process information, and then organize, store, and synthesize it?
How does an organization implement data analytics?
Do you really need to think like a Chief Information Officer?
What is the best way to protect data?
What makes a good dashboard?
What is the relationship between eating ice cream and getting attacked by a shark?
The nine chapters in this book are arranged in three parts that address systems concepts in general, tools and techniques, and future trend topics. Systems concepts include contrasting open and closed systems, performing data mining and big data analysis, and gauging data quality. Tools and techniques include analyzing both continuous and discrete data, applying probability basics, and practicing quantitative analysis such as descriptive and inferential statistics. Future trends include leveraging the Internet of Everything, modeling Artificial Intelligence, and establishing a Data Analytics Support Office (DASO).
Many examples are included that were generated using common software, such as Excel, Minitab, Tableau, SAS, and Crystal Ball. While words are good, examples can sometimes be a better teaching tool. For each example included, data files can be found on the companion website. Many of the data sets are tied to the global economy because they use data from shipping ports, air freight hubs, largest cities, and soccer teams. The appendices contain more detailed analysis including the 10 T's for Data Mining, Million Row Data Audit (MRDA) Processes, Analysis of Rainfall, and Simulation Models for Evaluating Traffic Flow.
Автор: Simon Walkowiak Название: Big Data Analytics with R ISBN: 1786466457 ISBN-13(EAN): 9781786466457 Издательство: Неизвестно Рейтинг: Цена: 11217.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Utilize R to uncover hidden patterns in your Big Data About This Book Perform computational analyses on Big Data to generate meaningful results Get a practical knowledge of R programming language while working on Big Data platforms like Hadoop, Spark, H2O and SQL/NoSQL databases, Explore fast, streaming, and scalable data analysis with the most cutting-edge technologies in the market Who This Book Is For This book is intended for Data Analysts, Scientists, Data Engineers, Statisticians, Researchers, who want to integrate R with their current or future Big Data workflows. It is assumed that readers have some experience in data analysis and understanding of data management and algorithmic processing of large quantities of data, however they may lack specific skills related to R. What You Will Learn Learn about current state of Big Data processing using R programming language and its powerful statistical capabilities Deploy Big Data analytics platforms with selected Big Data tools supported by R in a cost-effective and time-saving manner Apply the R language to real-world Big Data problems on a multi-node Hadoop cluster, e.g. electricity consumption across various socio-demographic indicators and bike share scheme usage Explore the compatibility of R with Hadoop, Spark, SQL and NoSQL databases, and H2O platform In Detail Big Data analytics is the process of examining large and complex data sets that often exceed the computational capabilities. R is a leading programming language of data science, consisting of powerful functions to tackle all problems related to Big Data processing. The book will begin with a brief introduction to the Big Data world and its current industry standards. With introduction to the R language and presenting its development, structure, applications in real world, and its shortcomings. Book will progress towards revision of major R functions for data management and transformations. Readers will be introduce to Cloud based Big Data solutions (e.g. Amazon EC2 instances and Amazon RDS, Microsoft Azure and its HDInsight clusters) and also provide guidance on R connectivity with relational and non-relational databases such as MongoDB and HBase etc. It will further expand to include Big Data tools such as Apache Hadoop ecosystem, HDFS and MapReduce frameworks. Also other R compatible tools such as Apache Spark, its machine learning library Spark MLlib, as well as H2O. Style and approach This book will serve as a practical guide to tackling Big Data problems using R programming language and its statistical environment. Each section of the book will present you with concise and easy-to-follow steps on how to process, transform and analyse large data sets."
Автор: Wong Название: Big Data Analytics in Genomics ISBN: 3319412787 ISBN-13(EAN): 9783319412788 Издательство: Springer Рейтинг: Цена: 23757.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание:
This contributed volume explores the emerging intersection between big data analytics and genomics. Recent sequencing technologies have enabled high-throughput sequencing data generation for genomics resulting in several international projects which have led to massive genomic data accumulation at an unprecedented pace. To reveal novel genomic insights from this data within a reasonable time frame, traditional data analysis methods may not be sufficient or scalable, forcing the need for big data analytics to be developed for genomics. The computational methods addressed in the book are intended to tackle crucial biological questions using big data, and are appropriate for either newcomers or veterans in the field.
This volume offers thirteen peer-reviewed contributions, written by international leading experts from different regions, representing Argentina, Brazil, China, France, Germany, Hong Kong, India, Japan, Spain, and the USA. In particular, the book surveys three main areas: statistical analytics, computational analytics, and cancer genome analytics. Sample topics covered include: statistical methods for integrative analysis of genomic data, computation methods for protein function prediction, and perspectives on machine learning techniques in big data mining of cancer. Self-contained and suitable for graduate students, this book is also designed for bioinformaticians, computational biologists, and researchers in communities ranging from genomics, big data, molecular genetics, data mining, biostatistics, biomedical science, cancer research, medical research, and biology to machine learning and computer science. Readers will find this volume to be an essential read for appreciating the role of big data in genomics, making this an invaluable resource for stimulating further research on the topic.
Автор: Pyne Название: Big Data Analytics ISBN: 8132236262 ISBN-13(EAN): 9788132236269 Издательство: Springer Рейтинг: Цена: 15372.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book has a collection of articles written by Big Data experts to describe some of the cutting-edge methods and applications from their respective areas of interest, and provides the reader with a detailed overview of the field of Big Data Analytics as it is practiced today. The chapters cover technical aspects of key areas that generate and use Big Data such as management and finance; medicine and healthcare; genome, cytome and microbiome; graphs and networks; Internet of Things; Big Data standards; bench-marking of systems; and others. In addition to different applications, key algorithmic approaches such as graph partitioning, clustering and finite mixture modelling of high-dimensional data are also covered. The varied collection of themes in this volume introduces the reader to the richness of the emerging field of Big Data Analytics.
Data Mining: Practical Machine Learning Tools and Techniques, Fourth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated fourth edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches.
Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including substantial new chapters on probabilistic methods and on deep learning. Accompanying the book is a new version of the popular WEKA machine learning software from the University of Waikato. Authors Witten, Frank, Hall, and Pal include today's techniques coupled with the methods at the leading edge of contemporary research.
Please visit the book companion website at https: //www.cs.waikato.ac.nz/ ml/weka/book.html.
It contains
Powerpoint slides for Chapters 1-12. This is a very comprehensive teaching resource, with many PPT slides covering each chapter of the book
Online Appendix on the Weka workbench; again a very comprehensive learning aid for the open source software that goes with the book
Table of contents, highlighting the many new sections in the 4th edition, along with reviews of the 1st edition, errata, etc.
Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projects
Presents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods
Includes a downloadable WEKA software toolkit, a comprehensive collection of machine learning algorithms for data mining tasks-in an easy-to-use interactive interface
Includes open-access online courses that introduce practical applications of the material in the book
Описание: Data Mining for Business Analytics: Concepts, Techniques, and Applications in XLMiner(R), Third Edition presents an applied approach to data mining and predictive analytics with clear exposition, hands-on exercises, and real-life case studies.
If you are a student or a professional looking for more technical skills, or if you are simply curious about the most up-to-date data analysis techniques and their powerful applications, then this is definitely the book for you.
Learning all of the required skills to master data science and machine learning could certainly be challenging, but in this book, Jason Callaway has condensed all of the knowledge you need into a clear and beginner-friendly introduction, with practical examples, detailed explanations, and tips and tricks from his experience.
Through his revolutionary and systematic approach, you can learn techniques to manipulate and process datasets, the principles of Python programming, and their real-world applications, regardless of your previous experience.
Here's just a tiny fraction of what you will discover:
What data science is, and why it has become fundamental in hundreds of business and technological applications
The basics of Python programming
Essential Python libraries such as NumPy, Pandas, and Matplotlib
All of the most effective computational methods for data analysis
Data visualization tools and techniques
How to build statistical and machine learning models (even if you are brand new to programming)
The future of Artificial Intelligence
How to build neural networks with Python
Step-by-step exercises, practical examples, and tips and tricks
Are you ready to develop a successful career in the growing industry of data science?
Описание: This book starts with an introduction to process modeling and process paradigms, then explains how to query and analyze process models, and how to analyze the process execution data.
Автор: Gunter Wallner Название: Data Analytics Applications in Gaming and Entertainment ISBN: 1138104434 ISBN-13(EAN): 9781138104433 Издательство: Taylor&Francis Рейтинг: Цена: 16078.00 р. Наличие на складе: Поставка под заказ.
Описание: Over the last decade big data and data mining has received growing interest and importance in game production to process and draw actionable insights from large volumes of player-related data in order to inform game design, to ensure customer satisfaction, to maximize revenues, and to drive technical innovation.
Автор: B. S.P. Mishra; Satchidananda Dehuri; Euiwhan Kim; Название: Techniques and Environments for Big Data Analysis ISBN: 3319275186 ISBN-13(EAN): 9783319275185 Издательство: Springer Рейтинг: Цена: 16979.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This volume is aiming at a wide range of readers andresearchers in the area of Big Data by presenting the recent advances in the fieldsof Big Data Analysis, as well as the techniques and tools used to analyze it.
Описание: This book describes analytical techniques for optimizing knowledge acquisition, processing, and propagation, especially in the contexts of cyber-infrastructure and big data.
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru