I will try to attach the data file below but if there is none then email me and i can send it trouugh wetransfer because the file is bigger in size.
You are to write a program or script to try and parse out email addresses and phone numbers from the data files.
The data files may be disk images or memory images. They will be in raw/dd format.
Bonus: Command-line options for extracting phone numbers and email addresses
ex: ./dataExtractor -phonenumbers -emails
The above would extract both, whereas
ex: ./dataExtractor -emails
The above would only extract email addresses
Turn in source code and output from running it on sample file.
The image was acquired using FTK imager on a 256MB USB drive. Some data is on drive in allocated space and some in unallocated space.
I am currently running it through Autopsy and will post the list of emails it finds so that you have some results to compare to.
************************************************************************************************************************
My script creates 2 files as output:
emails.txt which contains the emails it carved and phones.txt containing the phone numbers it carved
These are just what mine found, but your answers may vary. It is best to look at the phone numbers and emails to see if they look valid. I matched phone numbers of the format:
(xxx) xxx.xxxx
(xxx).xxx.xxxx
(xxx) xxx-xxxx
(xxx)-xxx-xxxx
xxx.xxx.xxxx
xxx-xxx-xxxx
xxx xxx-xxxx
xxx xxx.xxxx
*************************************************************************************************************************
I wanted to post an update on the emails.
So my current script matches 46,322 emails (3875) unique.
This is using only characters a-zA0-Z0-9._ as valid characters in the username and domain portions. Obviously an @ sign is also in the email address.
This is also ensuring that the TLD is a 2 to 4 character a-zA-Z pattern
*************************************************************************************************************************
PYTHON REFERENCE MATERIAL
https://hackernoon.com/determining-file-format-using-python-c4e7b18d4fc4 (Links to an external site.)
https://python-forum.io/Thread-read-a-binary-file-to-find-its-type (Links to an external site.)
https://stackoverflow.com/questions/1035340/reading-binary-file-and-looping-over-each-byte/1035360
DescriptionIn this final assignment, the students will demonstrate their ability to apply two ma
Path finding involves finding a path from A to B. Typically we want the path to have certain properties,such as being the shortest or to avoid going t
Develop a program to emulate a purchase transaction at a retail store. Thisprogram will have two classes, a LineItem class and a Transaction class. Th
1 Project 1 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of
1 Project 2 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of