Normal view MARC view ISBD view

Justin Grimmer, Margaret E. Roberts, and Brandon M. Stewart. Text as Data: A new framework for machine learning and the social sciences (Record no. 528431)

000 -LEADER
fixed length control field	04383nam a22001457a 4500
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	241205b \|\|\|\|\| \|\|\|\| 00\| 0 eng d
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Valadao, Rodrigo
245 ## - TITLE STATEMENT
Title	Justin Grimmer, Margaret E. Roberts, and Brandon M. Stewart. Text as Data: A new framework for machine learning and the social sciences
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc	Administrative Science Quarterly
300 ## - PHYSICAL DESCRIPTION
Extent	69(3),Sep, 2024: p.NP49-NP52
520 ## - SUMMARY, ETC.
Summary, etc	Text as Data is a remarkable milestone in demonstrating how analysis of large text-based corpora can enable original ways of theorizing in the social sciences. While the authors provide a thorough overview of algorithms and models, these features are not the book’s main attractions. The real value of this book stems from its insights into research principles for scholars’ use of textual data, enriched by numerous illustrations based on published studies. I believe the book will inspire organizational theorists to develop novel types of research questions and to deploy inventive approaches to examining social phenomena. The book offers two other noteworthy contributions. First, “a central argument of this book is that the goal of text as data research differs from the goals of computer science work” (p. 5). In articulating this difference, the authors “reject the idea that models of text should be optimized to recover one true underlying, inherent organization in the texts—because, [they] argue, no one such organization exists” (p. 9). This statement embodies what the authors call the agnostic approach to textual analysis. Second, the authors repeatedly emphasize the importance of validation when working with textual data, because “there are few (if any) theorems that justify text analysis methods as applied to natural language” (p. 30). With that, they underscore that validation which maintains “humans in the loop” (p. 31) is fundamental for good research in the social sciences. This agnostic approach and the call for extensive validation and human interpretation draw an important line between social scientists and computing scientists. The distinction is crucial because it reminds organizational theorists that our primary focus should be theorizing and that computational tools, regardless of how shiny they might be, are just that: tools. Text as Data is a remarkable milestone in demonstrating how analysis of large text-based corpora can enable original ways of theorizing in the social sciences. While the authors provide a thorough overview of algorithms and models, these features are not the book’s main attractions. The real value of this book stems from its insights into research principles for scholars’ use of textual data, enriched by numerous illustrations based on published studies. I believe the book will inspire organizational theorists to develop novel types of research questions and to deploy inventive approaches to examining social phenomena. The book offers two other noteworthy contributions. First, “a central argument of this book is that the goal of text as data research differs from the goals of computer science work” (p. 5). In articulating this difference, the authors “reject the idea that models of text should be optimized to recover one true underlying, inherent organization in the texts—because, [they] argue, no one such organization exists” (p. 9). This statement embodies what the authors call the agnostic approach to textual analysis. Second, the authors repeatedly emphasize the importance of validation when working with textual data, because “there are few (if any) theorems that justify text analysis methods as applied to natural language” (p. 30). With that, they underscore that validation which maintains “humans in the loop” (p. 31) is fundamental for good research in the social sciences. This agnostic approach and the call for extensive validation and human interpretation draw an important line between social scientists and computing scientists. The distinction is crucial because it reminds organizational theorists that our primary focus should be theorizing and that computational tools, regardless of how shiny they might be, are just that: tools.- Reproduced https://journals.sagepub.com/doi/full/10.1177/00018392241239340
773 ## - HOST ITEM ENTRY
Main entry heading	Administrative Science Quarterly
906 ## - LOCAL DATA ELEMENT F, LDF (RLIN)
Subject DIP	BOOK REVIEW
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Item type	Articles

Holdings
Withdrawn status	Lost status	Source of classification or shelving scheme	Damaged status	Not for loan	Permanent location	Current location	Date acquired	Serial Enumeration / chronology	Barcode	Date last seen	Koha item type
					Indian Institute of Public Administration	Indian Institute of Public Administration	2024-12-05	69(3),Sep, 2024: p.NP49-NP52	AR133823	2024-12-05	Articles

Powered by Koha