Monday, March 31, 2014

Week 10 Big Data

This week's reading assignment is The Historian's Macroscrope, currently available in draft form on a Word Press platform allowing comments.

Thus our questions this week involve both what it's like to read a work of scholarship on an unfamiliar idea in draft form and the substance of the work.

Process questions:
  • What do they mean by “live writing”?
  • What strategy did you use for reading this project?
  • Would you write in public like this?
  • What is going on behind their scenes?


Substance questions:
  • What is their Big Idea?
  • Can we imagine together the kinds of questions historians will want to ask with Big Data?
  • What do they mean by the “macroscope” metaphor? Why not a telescope?
  • Would you feel comfortable working with an “open notebook”? With live writing?
  • Do you accept the claim that we (as historians) need to understand the search algorithm?
  • What is “big data”? What do you think of their definition?: “If it’s more data that you could conceivably read yourself in a reasonable amount of time, or that requires computational intervention to make new sense of it, it’s big enough!”
  • Who is the audience for this project?
  • What can you learn by working with big data that you can’t without it?
  • What do you need to learn to do to work with big data?
  • How is normalization different from tokenization?
  • What kinds of questions can you imagine yourself asking with big data?
  • Why do they argue that Big Data does not herald an epistemological transformation for historians?
  • What is topic modelling, and what would you use it for? What uses could you put Paper Machine to?
  • What is network analysis and what might you use it for?

 As a bonus, we might watch this video about a doctoral student’s project in “distant reading”: http://phdcomics.com/comics/archive.php?comicid=1628

8 comments:

  1. You have certainly explained that Big data analytics is the process of examining big data to uncover hidden patterns, unknown correlations and other useful information that can be used to make better decisions..The big data analytics is the major part to be understood regarding Hadoop Training Chennai program. Via your quality content i get to know about that in deep. Thanks for sharing this here.

    ReplyDelete
  2. This comment has been removed by a blog administrator.

    ReplyDelete
  3. Amazing content.If you are interested instudying nodejs visit this website. Nodejs is an open source, server side web application that enables you to build fast and scalable web application that is capable of running large number of simultaneous connections that has high throughput.
    Node js Training in Chennai | Node JS training institute in chennai

    ReplyDelete