PinterestGoogle+

Big data, computer science, experimental methods, and computational text analysis are part of an ever-growing range of methods embraced by political science. As we expand our methodological horizons, the nature, scope, and focus of the questions we investigate will change.

For example, the combination of computerised text analysis techniques and Big Data now allows us to model millions of texts in a matter of hours. This can help us to gauge political preferences, the evolution of political language, or popular sentiment.

In turn, the increased use of the experimental method has allowed political scientists to investigate a wide set of issues, including voter behaviour, public opinion, and decision-making processes.

This new series, co-hosted by the Oxford University Politics Blog and the Oxford Q-Step Centre is all about “methods”.  What advances have we seen in recent years? What can we learn today that we could not a decade ago? And, what is the future of methods in political science? Find out more in our Advances in Political Science Methods series.

About Q-Step

Oxford-QC-Primary-Full-Tree-LogoQ-Step is a £19.5 million programme designed to promote a step-change in quantitative social science training in the UK Funded by the Nuffield Foundation, ESRC and HEFCE, Q-Step was developed as a strategic response to the shortage of quantitatively-skilled social science graduates.

Q-Step is funding fifteen universities across the UK to establish Q-Step Centres that will support the development and delivery of specialist undergraduate programmes, including new courses, work placements and pathways to postgraduate study.

The resulting expertise and resources will be shared across the higher education sector through an accompanying support programme which will also forge links with schools and employers.

Oxford is one of 15 universities to be selected nationally to host Q-Step. The Oxford Q-Step Centre is hosted by the Department of Politics and International Relations, in close co-operation with the Department of Sociology, University of Oxford.  

 

When does traditional statistical modelling (TSM) become machine learning (ML)?[i] “Machine learning” has truly become a buzzword that is applied rather liberally to a wide range of modelling applications. But, the difference is far from a question of semantics: there are fundamental differences between ML and TSM that data practitioners should keep in mind. Similarities But, let’s start off with some commonalities between ML and TSM. In both disciplines our aim is to build a (statistical) model (to use TSM terminology) that minimises loss, that is, that achieves the smallest possible difference between observed values and the values estimated by the model. In so doing, we have to achieve a successful balance between model complexity and generalisability: pick too complex a …

A few months ago, I published an exceptionally short paper presenting experimental evidence on a particular issue of survey methodology. This experience has taught me valuable lessons about conveying the necessary information under extreme restrictions on the word count. In its original version, my paper was 2,300 words long and it was formatted in accordance with the convention of the field: an introduction highlighting the relevance of the research question and the gaps in the existing literature, an empirical section describing the methods and presenting the results, and a conclusion discussing implications, limitations, and offering directions for future research. Since the study relied on experimental methods and was already short, I decided to submit this contribution to the Journal of …

On 18th April 2017 Theresa May announced a snap general election to take place on 8th June. The announcement came as a surprise and was widely believed to be motivated by the large lead in the polls (approximately twenty points) that Ms May holds over her main rival, Labour Party leader Jeremy Corbyn. In calling the snap election at this point, Theresa May has put a lot of confidence in her projected lead in the polls. This is interesting because British election polls have previously been met with a large degree of skepticism and distrust. In this blogpost, I briefly explore the British polling experience and highlight the various explanations that have been provided for the UK’s poor track record …

Below, I discuss and analyse pre-processing decisions in relation to an often-used application of text analysis: scaling. Here, I’ll be using a new tool, called preText (for R statistical software), to investigate the potential effect of different pre-processing options on our estimates. Replication material for this post may be found on my GitHub page. Feature Selection and Scaling Scaling algorithms rely on the bag-of-words (BoW) assumption, i.e. the idea that we can reduce text to individual words and sample them independently from a “bag” and still get some meaningful insights from the relative distribution of words across a corpus. For the demonstration below, I’ll be using the same selection of campaign speeches from one of my earlier blog posts, in which I used a …

About thirty major pieces of government legislation are produced annually in the UK. As there are five main opportunities to amend each bill (two stages in the Commons and three in the Lords) and bills may undergo hundreds, even thousands, of amendments, comprehensive quantitative analysis of legislative changes is almost impossible by manual methods. We used insights from bioinformatics to develop a semi-automatic procedure to map the changes in successive versions the text of a bill as it passes through parliament. This novel tool for scholars of the parliamentary process could be used, for example, to compare amendment patterns over time, between different topics or governments, and between legislatures. Parliamentary amendments A major role of parliament is to scrutinize and amend …

The 2016 United States presidential election—or in John Oliver’s most recent definition: ‘lice-on-a-rat-on-a-horse-corpse-on-fire-2016’—has reached its final leg. As a political scientist and a computational text analyst, I cannot resist sharing my two cents on an election that has certainly broken a model or two. Following in the footsteps of two colleagues who recently produced two excellent articles (you can read them here and here), in this post I’d like to analyse a few examples of the exceptional language used in this elections cycle. Text analysis can help us understand two commonly held beliefs or facts (the distinction has become a bit blurred over the course of this year’s election cycle) about the US elections: Donald Trump is running a negative …