Автор: Trevor Hastie; Robert Tibshirani; Jerome Friedman Название: The Elements of Statistical Learning ISBN: 0387848576 ISBN-13(EAN): 9780387848570 Издательство: Springer Рейтинг: Цена: 10480.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This major new edition features many topics not covered in the original, including graphical models, random forests, and ensemble methods. As before, it covers the conceptual framework for statistical data in our rapidly expanding computerized world.
Описание: Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow. Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes. Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production. What You Will Learn * Simplify data transformation with Spark Pipelines and Spark SQL * Bridge data engineering with machine learning * Architect modular data pipeline applications * Build reusable application components and libraries * Containerize your Spark applications for consistency and reliability * Use Docker and Kubernetes to deploy your Spark applications * Speed up application experimentation using Apache Zeppelin and Docker * Understand serializable structured data and data contracts * Harness effective strategies for optimizing data in your data lakes * Build end-to-end Spark structured streaming applications using Redis and Apache Kafka * Embrace testing for your batch and streaming applications * Deploy and monitor your Spark applications Who This Book Is For Professional software engineers who want to take their current skills and apply them to new and exciting opportunities within the data ecosystem, practicing data engineers who are looking for a guiding light while traversing the many challenges of moving from batch to streaming modes, data architects who wish to provide clear and concise direction for how best to harness and use Apache Spark within their organization, and those interested in the ins and outs of becoming a modern data engineer in today's fast-paced and data-hungry world
Автор: Maosong Sun, Xiaojie Wang, Baobao Chang Название: Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data ISBN: 3319690043 ISBN-13(EAN): 9783319690049 Издательство: Springer Рейтинг: Цена: 5300.00 р. Наличие на складе: Есть (3 шт.) Описание: This book constitutes the proceedings of the 16th China National Conference on Computational Linguistics, CCL 2017, and the 5th International Symposium on Natural Language Processing Based on Naturally Annotated Big Data, NLP-NABD 2017, held in Nanjing, China, in October 2017. Minority language information processing.
Автор: Foster Provost Название: Data Science For Business: What You Need To Know About Data Mining And Dataanalytic Thinking ISBN: 1449361323 ISBN-13(EAN): 9781449361327 Издательство: Wiley Рейтинг: Цена: 6334.00 р. Наличие на складе: Есть (1 шт.) Описание: This broad, deep, but not-too-technical guide introduces you to the fundamental principles of data science and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect.
Автор: Connie Clare, Maria Cruz Название: Engaging Researchers with Data Management ISBN: 1783747986 ISBN-13(EAN): 9781783747986 Издательство: Неизвестно Рейтинг: Цена: 9003.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This handbook is a collaboration by research institutions, for research institutions. It aims not only to inspire and engage, but also to help drive cultural change towards better data management. It has been written for anyone interested in RDM, or simply, good research practice.
Автор: Ahmed Elngar, Ambika Pawar, Prathamesh Churi Название: Data Protection and Privacy in Healthcare ISBN: 0367501082 ISBN-13(EAN): 9780367501082 Издательство: Taylor&Francis Рейтинг: Цена: 22202.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Healthcare management has rapidly changed over the last few years from disease centered to patient centered. The analysis of healthcare data plays an important role in healthcare management, but the privacy of patient`s records must be of equal concern. This book covers the privacy issues and data protection laws in healthcare industries.
Автор: Ashok N. Srivastava, Mehran Sahami Название: Text Mining ISBN: 1420059408 ISBN-13(EAN): 9781420059403 Издательство: Taylor&Francis Рейтинг: Цена: 15004.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Focuses on statistical methods for text mining and analysis. This work examines methods to automatically cluster and classify text documents and applies these methods in a variety of areas, including adaptive information filtering, information distillation, and text search.
Описание: Data science has a huge impact on how companies conduct business, and those who don`t learn about this revolutionaryfield could be left behind. You see, data science will help you make better decisions, know what products and services to release, and how to provide better service to your customers.
Автор: Murugan Anandarajan; Chelsey Hill; Thomas Nolan Название: Practical Text Analytics ISBN: 3319956620 ISBN-13(EAN): 9783319956626 Издательство: Springer Рейтинг: Цена: 9781.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book introduces text analytics as a valuable method for deriving insights from text data. Unlike other text analytics publications, Practical Text Analytics: Maximizing the Value of Text Data makes technical concepts accessible to those without extensive experience in the field. Using text analytics, organizations can derive insights from content such as emails, documents, and social media. Practical Text Analytics is divided into five parts. The first part introduces text analytics, discusses the relationship with content analysis, and provides a general overview of text mining methodology. In the second part, the authors discuss the practice of text analytics, including data preparation and the overall planning process. The third part covers text analytics techniques such as cluster analysis, topic models, and machine learning. In the fourth part of the book, readers learn about techniques used to communicate insights from text analysis, including data storytelling. The final part of Practical Text Analytics offers examples of the application of software programs for text analytics, enabling readers to mine their own text data to uncover information.
Автор: Kumar, Raghvendra, Sharma, Rohit, Pattnaik, Prasan Название: Multimedia Technologies in the Internet of Things Environment ISBN: 9811579644 ISBN-13(EAN): 9789811579646 Издательство: Springer Рейтинг: Цена: 25155.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: It bridges the gap between multimedia concepts and solutions by providing the current IOT frameworks, their applications in multimedia analysis, the strengths and limitations of the existing methods, and the future directions in multimedia IOT analytics.
Автор: Turab Lookman Название: Materials Discovery and Design ISBN: 3319994646 ISBN-13(EAN): 9783319994642 Издательство: Springer Рейтинг: Цена: 23757.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book addresses the current status, challenges and future directions of data-driven materials discovery and design. It presents the analysis and learning from data as a key theme in many science and cyber related applications. The challenging open questions as well as future directions in the application of data science to materials problems are sketched. Computational and experimental facilities today generate vast amounts of data at an unprecedented rate. The book gives guidance to discover new knowledge that enables materials innovation to address grand challenges in energy, environment and security, the clearer link needed between the data from these facilities and the theory and underlying science. The role of inference and optimization methods in distilling the data and constraining predictions using insights and results from theory is key to achieving the desired goals of real time analysis and feedback. Thus, the importance of this book lies in emphasizing that the full value of knowledge driven discovery using data can only be realized by integrating statistical and information sciences with materials science, which is increasingly dependent on high throughput and large scale computational and experimental data gathering efforts. This is especially the case as we enter a new era of big data in materials science with the planning of future experimental facilities such as the Linac Coherent Light Source at Stanford (LCLS-II), the European X-ray Free Electron Laser (EXFEL) and MaRIE (Matter Radiation in Extremes), the signature concept facility from Los Alamos National Laboratory. These facilities are expected to generate hundreds of terabytes to several petabytes of in situ spatially and temporally resolved data per sample. The questions that then arise include how we can learn from the data to accelerate the processing and analysis of reconstructed microstructure, rapidly map spatially resolved properties from high throughput data, devise diagnostics for pattern detection, and guide experiments towards desired targeted properties. The authors are an interdisciplinary group of leading experts who bring the excitement of the nascent and rapidly emerging field of materials informatics to the reader.
Автор: Ivezic?, Z?eljko, Название: Statistics, data mining, and machine learning in astronomy : ISBN: 0691198306 ISBN-13(EAN): 9780691198309 Издательство: Wiley Рейтинг: Цена: 12989.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание:
Statistics, Data Mining, and Machine Learning in Astronomy is the essential introduction to the statistical methods needed to analyze complex data sets from astronomical surveys such as the Panoramic Survey Telescope and Rapid Response System, the Dark Energy Survey, and the Large Synoptic Survey Telescope. Now fully updated, it presents a wealth of practical analysis problems, evaluates the techniques for solving them, and explains how to use various approaches for different types and sizes of data sets. Python code and sample data sets are provided for all applications described in the book. The supporting data sets have been carefully selected from contemporary astronomical surveys and are easy to download and use. The accompanying Python code is publicly available, well documented, and follows uniform coding standards. Together, the data sets and code enable readers to reproduce all the figures and examples, engage with the different methods, and adapt them to their own fields of interest.
An accessible textbook for students and an indispensable reference for researchers, this updated edition features new sections on deep learning methods, hierarchical Bayes modeling, and approximate Bayesian computation. The chapters have been revised throughout and the astroML code has been brought completely up to date.
Fully revised and expanded
Describes the most useful statistical and data-mining methods for extracting knowledge from huge and complex astronomical data sets
Features real-world data sets from astronomical surveys
Uses a freely available Python codebase throughout
Ideal for graduate students, advanced undergraduates, and working astronomers
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru