Javier Espinosa – THATCamp UGA in Valence 2018 http://ugainvalence2018.thatcamp.org Just another THATCamp site Thu, 21 Jun 2018 08:38:55 +0000 en-US hourly 1 https://wordpress.org/?v=4.9.12 The Web as a Data Platform for Everyone http://ugainvalence2018.thatcamp.org/2018/06/12/the-web-as-a-data-platform-for-everyone/ Tue, 12 Jun 2018 13:58:17 +0000 http://ugainvalence2018.thatcamp.org/?p=259

Type of session: Talk

Approximate duration: 1.5 hr

Skill level: All

We are living an exciting moment: a deluge of documents, texts, images, videos, tweets, etc. are accessible via the web that can be used for conducting social studies. In this sense, the Web is a data platform that can be fully exploited by non-specialist if they have the right tools. Yet, the majority of tools are limited to specific domains or very specific tasks. 

The objective of this talk is to introduce the basic concepts that builds the Web (HTML pages, HTTP protocol, URI/URL, client-server architectures, etc.) and how they work together. This knowledge can help non-expert you understand how to use more basic tools for retrieving data from the Web or communicating with computer specialists. As an example, I will describe how to build a corpus using articles from The Guardian web service.

]]>