Collecting tweets related to the stock market
Objectives
Gain experience of collecting data from Twitter using Twitter API
Gain experience of data storage to store the data and query
Gain experience of collecting real-time data
Gain experience of data cleaning
Important Notes
Work in groups of 4 students
All reports must be submitted as a PDF, with source code as an archive (e.g. zip, gz)
Save the submission as “Assignment-1_Group-#.zip”
Assignment Details
This assignment consists of four parts:
Collecting data: In this assignment, you need to collect data related to the stock market from Twitter for one week. In Twitter, ticker symbols like #gold are used for stocks and companies. You are requested to collect the tweets with some specific keywords and store them in different files. The following keywords should be used:
a. Altcoin
Bitcoin
Coindesk
Cryptocurrency
Gold
APPL
GOOG
YHOO
Each tweet is a json file with the following format:
{"created_at":”.......... ”,
"id":”…........ ”,
"text":" Time to buy some ether!\n#ethereum #investing #cryptocurrency” “user_id”:”…........ ”
…
}
Saving data: You need to save the requested data into csv format of 8 files where data related to each keyword is saved. Each file consist of four columns: tweet id, time of tweet, user id and text
Cleaning data: remove duplication, remove punctuations, remove numbers in tweets, and remove words with length less than 2
Visualizing data: You need to present the daily number of tweets for each keyword as well as the daily number of users
DescriptionIn this final assignment, the students will demonstrate their ability to apply two ma
Path finding involves finding a path from A to B. Typically we want the path to have certain properties,such as being the shortest or to avoid going t
Develop a program to emulate a purchase transaction at a retail store. Thisprogram will have two classes, a LineItem class and a Transaction class. Th
1 Project 1 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of
1 Project 2 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of