Multidimensional Mining of Massive Text Data, Zhang, Chao Han, Jiawei
Автор: Leskovec Jure Название: Mining of Massive Datasets ISBN: 1108476341 ISBN-13(EAN): 9781108476348 Издательство: Cambridge Academ Рейтинг: Цена: 10771.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Essential reading for students and practitioners, this book focuses on practical algorithms used to solve key problems in data mining, with exercises suitable for students from the advanced undergraduate level and beyond. This third edition includes new and extended coverage on decision trees, deep learning, and mining social-network graphs.
Автор: Chao Zhang, Jiawei Han Название: Multidimensional Mining of Massive Text Data ISBN: 1681735210 ISBN-13(EAN): 9781681735214 Издательство: Mare Nostrum (Eurospan) Рейтинг: Цена: 14692.00 р. Наличие на складе: Нет в наличии.
Описание: Unstructured text, as one of the most important data forms, plays a crucial role in data-driven decision making in domains ranging from social networking and information retrieval to scientific research and healthcare informatics. In many emerging applications, people's information need from text data is becoming multidimensional—they demand useful insights along multiple aspects from a text corpus. However, acquiring such multidimensional knowledge from massive text data remains a challenging task. This book presents data mining techniques that turn unstructured text data into multidimensional knowledge. We investigate two core questions. (1) How does one identify task-relevant text data with declarative queries in multiple dimensions? (2) How does one distill knowledge from text data in a multidimensional space? To address the above questions, we develop a text cube framework. First, we develop a cube construction module that organizes unstructured data into a cube structure, by discovering latent multidimensional and multi-granular structure from the unstructured text corpus and allocating documents into the structure. Second, we develop a cube exploitation module that models multiple dimensions in the cube space, thereby distilling from user-selected data multidimensional knowledge. Together, these two modules constitute an integrated pipeline: leveraging the cube structure, users can perform multidimensional, multigranular data selection with declarative queries; and with cube exploitation algorithms, users can extract multidimensional patterns from the selected data for decision making. The proposed framework has two distinctive advantages when turning text data into multidimensional knowledge: flexibility and label-efficiency. First, it enables acquiring multidimensional knowledge flexibly, as the cube structure allows users to easily identify task-relevant data along multiple dimensions at varied granularities and further distill multidimensional knowledge. Second, the algorithms for cube construction and exploitation require little supervision; this makes the framework appealing for many applications where labeled data are expensive to obtain.
Автор: O. Maimon; M. Last Название: Knowledge Discovery and Data Mining ISBN: 1441948422 ISBN-13(EAN): 9781441948427 Издательство: Springer Рейтинг: Цена: 19559.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book presents a specific and unified approach to Knowledge Discovery and Data Mining, termed IFN for Information Fuzzy Network methodology. Data Mining (DM) is the science of modelling and generalizing common patterns from large sets of multi-type data. DM is a part of KDD, which is the overall process for Knowledge Discovery in Databases. The accessibility and abundance of information today makes this a topic of particular importance and need. The book has three main parts complemented by appendices as well as software and project data that are accessible from the book's web site (http: //www.eng.tau.ac.iV-maimonlifn-kdg ). Part I (Chapters 1-4) starts with the topic of KDD and DM in general and makes reference to other works in the field, especially those related to the information theoretic approach. The remainder of the book presents our work, starting with the IFN theory and algorithms. Part II (Chapters 5-6) discusses the methodology of application and includes case studies. Then in Part III (Chapters 7-9) a comparative study is presented, concluding with some advanced methods and open problems. The IFN, being a generic methodology, applies to a variety of fields, such as manufacturing, finance, health care, medicine, insurance, and human resources. The appendices expand on the relevant theoretical background and present descriptions of sample projects (including detailed results).
Автор: David A. Bader Название: Massive graph analytics / ISBN: 0367464128 ISBN-13(EAN): 9780367464127 Издательство: Taylor&Francis Рейтинг: Цена: 20671.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Massive Graph Analytics provides a comprehensive introduction to massive graph analytics, featuring contributions from thought leaders across academia, industry, and government. The book will be beneficial to students, researchers and practitioners, in academia, national laboratories, and industry in massive scale graph analytics.
Автор: Jacob Kogan; Charles Nicholas; Marc Teboulle Название: Grouping Multidimensional Data ISBN: 3642066542 ISBN-13(EAN): 9783642066542 Издательство: Springer Рейтинг: Цена: 18167.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Clustering can be used as an independent data mining task to discern intrinsic characteristics of data, or as a preprocessing step with the clustering results then used for classification, correlation analysis, or anomaly detection.Kogan and his co-editors have put together recent advances in clustering large and high-dimension data.
Автор: Charu C. Aggarwal Название: Machine Learning for Text ISBN: 3030088073 ISBN-13(EAN): 9783030088071 Издательство: Springer Рейтинг: Цена: 6986.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Text analytics is a field that lies on the interface of information retrieval,machine learning, and natural language processing, and this textbook carefully covers a coherently organized framework drawn from these intersecting topics. The chapters of this textbook is organized into three categories:- Basic algorithms: Chapters 1 through 7 discuss the classical algorithms for machine learning from text such as preprocessing, similarity computation, topic modeling, matrix factorization, clustering, classification, regression, and ensemble analysis.- Domain-sensitive mining: Chapters 8 and 9 discuss the learning methods from text when combined with different domains such as multimedia and the Web. The problem of information retrieval and Web search is also discussed in the context of its relationship with ranking and machine learning methods. - Sequence-centric mining: Chapters 10 through 14 discuss various sequence-centric and natural language applications, such as feature engineering, neural language models, deep learning, text summarization, information extraction, opinion mining, text segmentation, and event detection. This textbook covers machine learning topics for text in detail. Since the coverage is extensive,multiple courses can be offered from the same book, depending on course level. Even though the presentation is text-centric, Chapters 3 to 7 cover machine learning algorithms that are often used indomains beyond text data. Therefore, the book can be used to offer courses not just in text analytics but also from the broader perspective of machine learning (with text as a backdrop). This textbook targets graduate students in computer science, as well as researchers, professors, and industrial practitioners working in these related fields. This textbook is accompanied with a solution manual for classroom teaching.
Описание: This work considers Kronecker-based models with finite as well as countably infinite state spaces for multidimensional Markovian systems by paying particular attention to those whose reachable state spaces are smaller than their product state spaces. Numerical methods for steady-state and transient analysis of Kronecker-based multidimensional Markovian models are discussed in detail together with implementation issues. Case studies are provided to explain concepts and motivate use of methods. Having grown out of research from the past twenty years, this book expands upon the author’s previously published book Analyzing Markov Chains using Kronecker Products (Springer, 2012). The subject matter is interdisciplinary and at the intersection of applied mathematics and computer science. The book will be of use to researchers and graduate students with an understanding of basic linear algebra, probability, and discrete mathematics.
Автор: Achille A. Lemmi; Gianni Betti Название: Fuzzy Set Approach to Multidimensional Poverty Measurement ISBN: 1441941711 ISBN-13(EAN): 9781441941718 Издательство: Springer Рейтинг: Цена: 23757.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This volume brings together advanced thinking on the multidimensional measurement of poverty.
Автор: Paul Fieguth Название: Statistical Image Processing and Multidimensional Modeling ISBN: 1461427053 ISBN-13(EAN): 9781461427056 Издательство: Springer Рейтинг: Цена: 18167.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Examples abound throughout remote sensing (satellite data mapping, data assimilation, climate-change studies, land use), medical imaging (organ segmentation, anomaly detection), computer vision (image classification, segmentation), and other 2D/3D problems (biological imaging, porous media).
Автор: Ingwer Borg, Patrick J.F. Groenen, Patrick Mair Название: Applied multidimensional scaling and unfolding. ISBN: 3319734709 ISBN-13(EAN): 9783319734705 Издательство: Springer Рейтинг: Цена: 9083.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: This book introduces multidimensional scaling (MDS) and unfolding as data analysis techniques for applied researchers. Unfolding is a related method that maps preference data (typically evaluative ratings of different persons on a set of objects) as distances between two sets of points (representing the persons and the objects, resp.).
Описание: This work considers Kronecker-based models with finite as well as countably infinite state spaces for multidimensional Markovian systems by paying particular attention to those whose reachable state spaces are smaller than their product state spaces. Numerical methods for steady-state and transient analysis of Kronecker-based multidimensional Markovian models are discussed in detail together with implementation issues. Case studies are provided to explain concepts and motivate use of methods. Having grown out of research from the past twenty years, this book expands upon the author’s previously published book Analyzing Markov Chains using Kronecker Products (Springer, 2012). The subject matter is interdisciplinary and at the intersection of applied mathematics and computer science. The book will be of use to researchers and graduate students with an understanding of basic linear algebra, probability, and discrete mathematics.
Автор: Nishisato, Shizuhiko Название: Multidimensional Nonlinear Descriptive Analysis ISBN: 0367390647 ISBN-13(EAN): 9780367390648 Издательство: Taylor&Francis Рейтинг: Цена: 9798.00 р. Наличие на складе: Нет в наличии.
Описание:
Quantification of categorical, or non-numerical, data is a problem that scientists face across a wide range of disciplines. Exploring data analysis in various areas of research, such as the social sciences and biology, Multidimensional Nonlinear Descriptive Analysis presents methods for analyzing categorical data that are not necessarily sampled randomly from a normal population and often involve nonlinear relations.
This reference not only provides an overview of multidimensional nonlinear descriptive analysis (MUNDA) of discrete data, it also offers new results in a variety of fields. The first part of the book covers conceptual and technical preliminaries needed to understand the data analysis in subsequent chapters. The next two parts contain applications of MUNDA to diverse data types, with each chapter devoted to one type of categorical data, a brief historical comment, and basic skills peculiar to the data types. The final part examines several problems and then concludes with suggestions for future progress.
Covering both the early and later years of MUNDA research in the social sciences, psychology, ecology, biology, and statistics, this book provides a framework for potential developments in even more areas of study.
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru