Brian Everitt Book

Language: en
Pages: 202

Making Sense of Large Social Media Corpora

Author(s): Antonio Moreno-Ortiz

Categories: Language Arts & Disciplines

Type: Book
-
Published: 2024-04-29
-
Publisher: Springer Nature

This open access book offers a comprehensive overview of available techniques and approaches to explore large social media corpora, using as an illustrative case study the Coronavirus Twitter corpus. First, the author describes in detail a number of methods, strategies, and tools that can be used to access, manage, and explore large Twitter/X corpora, including both user-friendly applications and more advanced methods that involve the use of data management skills and custom programming scripts. He goes on to show how these tools and methods are applied to explore one of the largest Twitter datasets on the COVID-19 pandemic publicly released, covering the two years when the pandemic had the strongest impact on society. Specifically, keyword extraction, topic modelling, sentiment analysis, and hashtag analysis methods are described, contrasted, and applied to extract information from the Coronavirus Twitter Corpus. The book will be of interest to students and researchers in fields that make use of big data to address societal and linguistic concerns, including corpus linguistics, sociology, psychology, and economics.

Language: en
Pages: 523

Generalized Latent Variable Modeling

Author(s): Anders Skrondal, Sophia Rabe-Hesketh

Categories: Mathematics

Type: Book
-
Published: 2004-05-11
-
Publisher: CRC Press

This book unifies and extends latent variable models, including multilevel or generalized linear mixed models, longitudinal or panel models, item response or factor models, latent class or finite mixture models, and structural equation models. Following a gentle introduction to latent variable modeling, the authors clearly explain and contrast a wi

Language: en
Pages: 583

Regression Modeling Strategies

Author(s): Frank E. Harrell

Categories: Mathematics

Type: Book
-
Published: 2013-03-09
-
Publisher: Springer Science & Business Media

Many texts are excellent sources of knowledge about individual statistical tools, but the art of data analysis is about choosing and using multiple tools. Instead of presenting isolated techniques, this text emphasizes problem solving strategies that address the many issues arising when developing multivariable models using real data and not standard textbook examples. It includes imputation methods for dealing with missing data effectively, methods for dealing with nonlinear relationships and for making the estimation of transformations a formal part of the modeling process, methods for dealing with "too many variables to analyze and not enough observations," and powerful model validation techniques based on the bootstrap. This text realistically deals with model uncertainty and its effects on inference to achieve "safe data mining".

Language: en
Pages: 516

Elementary Cluster Analysis

Author(s): James C. Bezdek

Categories: Science

Type: Book
-
Published: 2022-10-17
-
Publisher: CRC Press

The availability of packaged clustering programs means that anyone with data can easily do cluster analysis on it. But many users of this technology don't fully appreciate its many hidden dangers. In today's world of "grab and go algorithms," part of my motivation for writing this book is to provide users with a set of cautionary tales about cluster analysis, for it is very much an art as well as a science, and it is easy to stumble if you don't understand its pitfalls. Indeed, it is easy to trip over them even if you do! The parenthetical word usually in the title is very important, because all clustering algorithms can and do fail from time to time. Modern cluster analysis has become so te...

Language: en
Pages: 206

A Whistle-Stop Tour of Statistics

Author(s): Brian Everitt

Categories: Mathematics

Type: Book
-
Published: 2011-12-01
-
Publisher: CRC Press

The book is intended as a quick source of reference and as an aide-memoir for students taking A-level, undergraduate or postgraduate statistics courses. It includes numerous examples, helping instructors on such courses by providing their students with small data sets with which to work.