000 -LÍDER |
Campo de controle fixo |
nam a22 7a 4500 |
003 - CÓDIGO MARC DA AGÊNCIA CATALOGADORA |
Campo de controle |
BR-BrENAP |
005 - DATA E HORA DA ÚLTIMA ATUALIZAÇÃO |
Campo de controle |
20230809180814.0 |
008 - CAMPO DE TAMANHO FIXO |
Campo fixo de controle |
230314t20222022njua b 001 0 eng d |
020 ## - ISBN - INTERNATIONAL STANDARD BOOK NUMBER |
ISBN |
9780691207551 |
040 ## - FONTE DA CATALOGAÇÃO |
Agência catalogadora |
BR-BrENAP |
Idioma da catalogação |
Pt_BR |
041 ## - IDIOMA |
Idioma do texto |
eng |
090 ## - NÚMERO DE CLASSIFICAÇÃO |
Número de Classificação |
006.312 |
Cutter |
G8648t |
100 1# - ENTRADA PRINCIPAL - NOME PESSOAL |
Nome pessoal |
Grimmer, Justin |
9 (RLIN) |
68335 |
245 10 - TÍTULO PRINCIPAL |
Título principal |
Text as data : |
Subtítulo |
a new framework for machine learning and the social sciences / |
Indicação de responsabilidade |
por Justin Grimmer, Margaret E. Roberts e Brandon M. Stewart. -- |
260 ## - IMPRENTA (PUBLICAÇÃO, DISTRIBUIÇÃO, ETC.) |
Lugar de publicação, distribuição, etc. |
Nova Jersey, EUA ; |
-- |
Oxford, UK : |
Nome do editor, distribuidor, etc. |
Princeton University Press, |
Data de publicação, distribuição, etc |
2022. |
300 ## - DESCRIÇÃO FÍSICA |
Extensão |
336 p. : |
Detalhes físicos adicionais |
il. |
505 ## - NOTA DE CONTEÚDO |
Título |
Preface |
-- |
PART I - PRELIMINARIES |
-- |
CHAPTER 1 - Introduction |
-- |
1.1 How This Book Informs the Social Sciences |
-- |
1.2 How This Book Informs the Digital Humanities |
-- |
1.3 How This Book Informs Data Science in Industry and Government |
-- |
1.4 A Guide to This Book |
-- |
1.5 Conclusion |
-- |
CHAPTER 2 - Social Science Research and Text Analysis |
-- |
2.1 Discovery |
-- |
2.2 Measurement |
-- |
2.3 Inference |
-- |
2.4 Social Science as an Interative and Cumulative Process |
-- |
2.5 An Agnostic Approach to Text Analysis |
-- |
2.6 Discovery, Meansurement, and Causal Inference: How the Chinese Government Censors Social Media |
-- |
2.7 Six Principals of Text Analysis |
-- |
2.8 Conclusion: nText Data and Social Science |
-- |
PART II - SELECTION AND REPRESENTATION |
-- |
CHAPTER 3 - Principles of Selection and Representation |
-- |
3.1 Principle 1: Question-Specific Corpus Construction |
-- |
3.2 Principle 2: No Values-Free Corpus Construction |
-- |
3.3 Principle 3: No Right Way to Represent Text |
-- |
3.4 Principle 4: Validation |
-- |
3.5 State of the Union Addresses |
-- |
3.6 The Autorship of the Federalist Papers |
-- |
3.7 Conclusion |
-- |
CHAPTER 4 - Selecting Documents |
-- |
4.1 Populations and Quantities of Interest |
-- |
4.2 Four Types of Bias |
-- |
4.3 Considerations of "Found Data" |
-- |
4.4 Conclusion |
-- |
CHAPTER 5 - Bag of Words |
-- |
5.1 The Bag of Words Model |
-- |
5.2 Choose the Unit of Analysis |
-- |
5.3 Tokenize |
-- |
5.4 Reduce Complexity |
-- |
5.5 Construct Document-Feature Matrix |
-- |
5.6 Rethinking the Defaults |
-- |
5.7 Conclusion |
-- |
CHAPTER 6 - The Multinomial Language Model |
-- |
6.1 Multinomial Distribution |
-- |
6.2 Basic Language Modeling |
-- |
6.3 Regularization and Smoothing |
-- |
6.4 The Dirichlet Distribution |
-- |
6.5 Conclusion |
-- |
CHAPTER 7 - The Vector Space Model and Similarity Metrics |
-- |
7.1 Similarity Metrics |
-- |
7.2 Distance Metrics |
-- |
7.3 tf-idf Weighting |
-- |
7.4 Conclusion |
-- |
CHAPTER 8 - Distributed Representations of Words |
-- |
8.1 Why Word Embeddings |
-- |
8.2 Estimating Word Embeddings |
-- |
8.3 Aggregating Word Embeddings to the Document Level |
-- |
8.4 Validation |
-- |
8.5 Contextualized Word Embeddings |
-- |
8.6 Conclusion |
-- |
CHAPTER 9 - Rpresentations from Language Sequences |
-- |
9.1 Text Reuse |
-- |
9.2 Parts of Speech Tegging |
-- |
9.3 Named-Entity Recognition |
-- |
9.4 Dependency Parsing |
-- |
9.5 Broader Information Extraction Tasks |
-- |
9.6 Conclusion |
-- |
PART III - DISCOVERY |
-- |
CHAPTER 10 - Principles of Discovery |
-- |
10.1 Principle 1: Context Relevance |
-- |
10.2 Principle 2: No Ground Truth |
-- |
10.3 Principle 3: Judge the Concept, Not the Method |
-- |
10.4 Principle 4: Separate Data Is Best |
-- |
10.5 Conceptualizing the US Congress |
-- |
10.6 Conclusion |
-- |
CHAPTER 11 - Discriminating Words |
-- |
11.1 Mutual Information |
-- |
11.2 Fightin' Words |
-- |
11.3 Fictitious Prediction Problems |
-- |
11.4 Conclusion |
-- |
CHAPTER 12 - Clustering |
-- |
12.1 An Initial Example Using k-Means Clustering |
-- |
12.2 Representations to Clustering |
-- |
12.3 Approaches to Clustering |
-- |
12.4 Making Choices |
-- |
12.5 The Human Side of Clustering |
-- |
12.6 Conclusion |
-- |
CHAPTER 13 - Topic Models |
-- |
13.1 Latent Dirichlet Allocation |
-- |
13.2 Interpreting the Output of Topic Models |
-- |
13.3 Incorporating Structure into LDA |
-- |
13.4 Structural Topic Models |
-- |
13.5 Labeling Topic Models |
-- |
13.6 Conclusion |
-- |
CHAPTER 14 - Low-Dimensional Document Embeddings |
-- |
14.1 Principal Component Analysis |
-- |
14.2 Classical Multidimensional Scaling |
-- |
14.3 Conclusion |
-- |
PART IV - MEASUREMENT |
-- |
CHAPTER 15 - Principles of Measurement |
-- |
15.1 From Concept to Measurement |
-- |
15.2 What Makes a Good Measurement |
-- |
15.3 Balancing Discovery and Measurement with Sample Splits |
-- |
CHAPTER 16 - Word Counting |
-- |
16.1 Keyword Counting |
-- |
16.2 Dictionary Methods |
-- |
16.3 Limitations and Validations of Dictionary Methods |
-- |
16.4 Conclusion |
-- |
CHAPTER 17 - An Overview of Supervised Classification |
-- |
17.1 Example: Discursive Governance |
-- |
17.2 Create a Training Set |
-- |
17.3 Classify Documents with Supervised Learning |
-- |
17.4 Check Performance |
-- |
17.5 Using the Measure |
-- |
17.6 Conclusion |
-- |
CHAPTER 18 - Coding a Training Set |
-- |
18.1 Characteristics of a Good Training Set |
-- |
18.2 Hand Coding |
-- |
18.3 Crowdsourcing |
-- |
18.4 Supervision with Found Data |
-- |
18.5 Conclusion |
-- |
CHAPTEER 19 - Classifying Documents with Supervised |
-- |
19.1 Naive Bayes |
-- |
19.2 Machine Learning |
-- |
19.3 Example: Estimating Jihad Scores |
-- |
19.4 Conclusion |
-- |
CHAPTER 20 - Checking Performance |
-- |
20.1 Validation with Gold-Standard Data |
-- |
20.2 Validation without Gold-Standar Data |
-- |
20.3 Example: Validating Jihad Scores |
-- |
20.4 Conclusion |
-- |
CHAPTER 21 - Repurposing Discovery Methods |
-- |
21.1 Unsupervised Methods Tend to Measure Subject |
-- |
21.2 Example: Scaling via Differential Word Rates |
-- |
21.3 A Workflow for Repurposing Unsupervised Methods |
-- |
21.4 Concerns in Repurposing Unsupervised Methods for Measurement |
-- |
21.5 Conclusion |
-- |
PART V - INFERENCE |
-- |
CHAPTER 22 - Principles of inference |
-- |
22.1 Prediction |
-- |
22.2 Causal inference |
-- |
22.3 Comparing Prediction and Causal Inference |
-- |
22.4 Partial and General Equilibrium in Prediction and Causal Inference |
-- |
22.5 Conclusion |
-- |
CHAPTER 23 - Prediction |
-- |
23.1 The Basic Task of Prediction |
-- |
23.2 Similarities and Fifferences between Prediction and Measurement |
-- |
23.3 Five Principles of Prediction |
-- |
23.4 Using Text as Data for Prediction: Examples |
-- |
23.5 Conclusion |
-- |
CHAPTER 24 - Causal Inference |
-- |
24.1 Introduction to Causal Inference |
-- |
24.2 Similarities and Differences between Prediction and Measurement, and Causal Inference |
-- |
24.3 Key Principles of Causal Inference with Text |
-- |
24.4 The Mapping Function |
-- |
24.5 Workflows for Making Causal Inferences with Text |
-- |
24.6 Conclusion |
-- |
CHAPTER 25 - Text as Outcome |
-- |
25.1 An Experiment on Immigration |
-- |
25.2 The Effect of Presidential Public Appeals |
-- |
25.3 Conclusion |
-- |
CHAPTER 26 - Text as Treatment |
-- |
26.1 An Experiment Using Trump's Tweets |
-- |
26.2 A Candidate Biography Experiment |
-- |
26.3 Conclusion |
-- |
CHAPTER 27 - Text as Confounder |
-- |
27.1 Regression Adjustments for Text Confounders |
-- |
27.2 Matching Adjustments for Text |
-- |
27.3 Conclusion |
-- |
PART VI - CONCLUSION |
-- |
28.1 How to Use Text as Data in the Social Sciences |
-- |
28.2 Applying Our Principles beyond Text Data |
-- |
28.3 Avoiding the Cycle of Creation and Destruction in Social Science Methodology |
-- |
Acknowledgments |
-- |
Bibliography |
-- |
Index |
650 #0 - ENTRADA DE ASSUNTO - ASSUNTO TÓPICO |
Cabeçalho tópico ou nome geográfico |
Mineração de Dados de Texto |
9 (RLIN) |
68336 |
650 #0 - ENTRADA DE ASSUNTO - ASSUNTO TÓPICO |
Cabeçalho tópico ou nome geográfico |
Ciências Sociais - Processamento de Dados |
9 (RLIN) |
68337 |
650 #0 - ENTRADA DE ASSUNTO - ASSUNTO TÓPICO |
Cabeçalho tópico ou nome geográfico |
Machine Learning |
9 (RLIN) |
68338 |
700 1# - ENTRADA SECUNDÁRIA - NOME PESSOAL |
Nome pessoal |
Roberts, Margaret E. |
9 (RLIN) |
68339 |
700 1# - ENTRADA SECUNDÁRIA - NOME PESSOAL |
Nome pessoal |
Stewart, Brandon M. |
9 (RLIN) |
68340 |
909 ## - IDENTIFICAÇÃO DO CATALOGADOR |
Ano e mês da catalogação (aaaamm) |
202308 |
Identificação do catalogador |
Raynara |
942 ## - TIPO ESPECÍFICO |
Tipo de material |
Livro Geral |