Автор: Trevor Hastie; Robert Tibshirani; Jerome Friedman Название: The Elements of Statistical Learning ISBN: 0387848576 ISBN-13(EAN): 9780387848570 Издательство: Springer Рейтинг: Цена: 10480.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This major new edition features many topics not covered in the original, including graphical models, random forests, and ensemble methods. As before, it covers the conceptual framework for statistical data in our rapidly expanding computerized world.
Описание: Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow. Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes. Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production. What You Will Learn * Simplify data transformation with Spark Pipelines and Spark SQL * Bridge data engineering with machine learning * Architect modular data pipeline applications * Build reusable application components and libraries * Containerize your Spark applications for consistency and reliability * Use Docker and Kubernetes to deploy your Spark applications * Speed up application experimentation using Apache Zeppelin and Docker * Understand serializable structured data and data contracts * Harness effective strategies for optimizing data in your data lakes * Build end-to-end Spark structured streaming applications using Redis and Apache Kafka * Embrace testing for your batch and streaming applications * Deploy and monitor your Spark applications Who This Book Is For Professional software engineers who want to take their current skills and apply them to new and exciting opportunities within the data ecosystem, practicing data engineers who are looking for a guiding light while traversing the many challenges of moving from batch to streaming modes, data architects who wish to provide clear and concise direction for how best to harness and use Apache Spark within their organization, and those interested in the ins and outs of becoming a modern data engineer in today's fast-paced and data-hungry world
Автор: Maosong Sun, Xiaojie Wang, Baobao Chang Название: Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data ISBN: 3319690043 ISBN-13(EAN): 9783319690049 Издательство: Springer Рейтинг: Цена: 5300.00 р. Наличие на складе: Есть (3 шт.) Описание: This book constitutes the proceedings of the 16th China National Conference on Computational Linguistics, CCL 2017, and the 5th International Symposium on Natural Language Processing Based on Naturally Annotated Big Data, NLP-NABD 2017, held in Nanjing, China, in October 2017. Minority language information processing.
Автор: Foster Provost Название: Data Science For Business: What You Need To Know About Data Mining And Dataanalytic Thinking ISBN: 1449361323 ISBN-13(EAN): 9781449361327 Издательство: Wiley Рейтинг: Цена: 6334.00 р. Наличие на складе: Есть (1 шт.) Описание: This broad, deep, but not-too-technical guide introduces you to the fundamental principles of data science and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect.
Автор: Turab Lookman Название: Materials Discovery and Design ISBN: 3319994646 ISBN-13(EAN): 9783319994642 Издательство: Springer Рейтинг: Цена: 23757.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book addresses the current status, challenges and future directions of data-driven materials discovery and design. It presents the analysis and learning from data as a key theme in many science and cyber related applications. The challenging open questions as well as future directions in the application of data science to materials problems are sketched. Computational and experimental facilities today generate vast amounts of data at an unprecedented rate. The book gives guidance to discover new knowledge that enables materials innovation to address grand challenges in energy, environment and security, the clearer link needed between the data from these facilities and the theory and underlying science. The role of inference and optimization methods in distilling the data and constraining predictions using insights and results from theory is key to achieving the desired goals of real time analysis and feedback. Thus, the importance of this book lies in emphasizing that the full value of knowledge driven discovery using data can only be realized by integrating statistical and information sciences with materials science, which is increasingly dependent on high throughput and large scale computational and experimental data gathering efforts. This is especially the case as we enter a new era of big data in materials science with the planning of future experimental facilities such as the Linac Coherent Light Source at Stanford (LCLS-II), the European X-ray Free Electron Laser (EXFEL) and MaRIE (Matter Radiation in Extremes), the signature concept facility from Los Alamos National Laboratory. These facilities are expected to generate hundreds of terabytes to several petabytes of in situ spatially and temporally resolved data per sample. The questions that then arise include how we can learn from the data to accelerate the processing and analysis of reconstructed microstructure, rapidly map spatially resolved properties from high throughput data, devise diagnostics for pattern detection, and guide experiments towards desired targeted properties. The authors are an interdisciplinary group of leading experts who bring the excitement of the nascent and rapidly emerging field of materials informatics to the reader.
Описание: Data science revolves around two giants, which are big data analytics and deep learning. It is becoming challenging to handle and retrieve useful information due to how fast data is expanding. This book presents the technologies and tools to simplify and streamline the formation of big data along with deep learning systems.
Описание: Learning about distributed systems, becoming familiar with technologies such as containers and functions, and knowing how to put everything together can be daunting. With this practical guide, you`ll get up to speed on patterns for building cloud native applications and best practices for common tasks such as messaging, eventing, and DevOps.
Описание: Contrary to popular myth, we do not yet live in the "Information Age." At best, we live the "Data Age," obsessed with the production, collection, storage, dissemination, and monetization of digital data. But data, in and of itself, isnt valuable. Data only becomes valuable when we make sense of it. We rely on "information professionals" to help us understand data, but most fail in their efforts. Why? Not because they lack intelligence or tools, but mostly because they lack the necessary skills. Most information professionals have been trained primarily in the use of data analysis tools (Tableau, PowerBI, Qlik, SAS, Excel, R, etc.), but even the best tools are only useful in the hands of skilled individuals. Anyone can pick up a hammer and pound a nail, but only skilled carpenters can use a hammer to build a reliable structure. Making sense of data is skilled work, and developing those skills requires study and practice. Weaving data into understanding involves several distinct but complementary thinking skills. Foremost among them are critical thinking and scientific thinking. Until information professionals develop these capabilities, we will remain in the dark ages of data. This book is for information professionals, especially those who have been thrust into this important work without having a chance to develop these foundational skills. If youre an information professional and have never been trained to think critically and scientifically with data, this book will get you started. Once on this path, youll be able to help usher in an Information Age worthy of the name.
Автор: Kavita Taneja, Harmunish Taneja Название: Data science and innovations for intelligent systems : ISBN: 0367676273 ISBN-13(EAN): 9780367676278 Издательство: Taylor&Francis Рейтинг: Цена: 22202.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book not only focuses on the practical applications of data science to achieve computational excellence, but also digs deep into the issues and implications of intelligent systems.
Описание: THE SERIES: INTELLIGENT BIOMEDICAL DATA ANALYSIS By focusing on the methods and tools for intelligent data analysis, this series aims to narrow the increasing gap between data gathering and data comprehension. Emphasis is also given to the problems resulting from automated data collection in modern hospitals, such as analysis of computer-based patient records, data warehousing tools, intelligent alarming, effective and efficient monitoring. In medicine, overcoming this gap is crucial since medical decision making needs to be supported by arguments based on existing medical knowledge as well as information, regularities and trends extracted from big data sets.
Автор: Alex Lui, Anna Farzinder, Mingboo Gong Название: Transforming Healthcare with Big Data and AI ISBN: 1641138971 ISBN-13(EAN): 9781641138970 Издательство: Mare Nostrum (Eurospan) Рейтинг: Цена: 7069.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Healthcare and technology are at a convergence point where significant changes are poised to take place. The vast and complex requirements of medical record keeping, coupled with stringent patient privacy laws, create an incredibly unwieldy maze of health data needs. While the past decade has seen giant leaps in AI, machine learning, wearable technologies, and data mining capacities that have enabled quantities of data to be accumulated, processed, and shared around the globe. Transforming Healthcare with Big Data and AI examines the crossroads of these two fields and looks to the future of leveraging advanced technologies and developing data ecosystems to the healthcare field.
This book is the product of the Transforming Healthcare with Data conference, held at the University of Southern California. Many speakers and digital healthcare industry leaders contributed multidisciplinary expertise to chapters in this work. Authors’ backgrounds range from data scientists, healthcare experts, university professors, and digital healthcare entrepreneurs. If you have an understanding of data technologies and are interested in the future of Big Data and A.I. in healthcare, this book will provide a wealth of insights into the new landscape of healthcare.
Автор: Connie Clare, Maria Cruz Название: Engaging Researchers with Data Management ISBN: 1783747986 ISBN-13(EAN): 9781783747986 Издательство: Неизвестно Рейтинг: Цена: 9003.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This handbook is a collaboration by research institutions, for research institutions. It aims not only to inspire and engage, but also to help drive cultural change towards better data management. It has been written for anyone interested in RDM, or simply, good research practice.
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru