Eron Fraud

Developed machine learning algorithms to predict person of interest for fraud detection based on a small and complicated Enron dataset.

Download .zip Download .tar.gz View on GitHub

Predicting Fraud at Enron: Who is Person of Interest?

Project report: IPython Notebook documenting the entire process. It includes more development scripts and work-in-progress output (charts and tables) than poi_id.py

poi_id.py: scripts for identifying POIs. Codes may not be in the exact same order as in the original file.

final_project_dataset.pkl: original dataset

Auxiliary codes:

  • feature_format.py
  • tester.py

Output from poi_id.py:

  • my_classifier.pkl
  • my_dataset.pkl
  • my_feature_list.pkl

Resource: include references and additional materials coming with final_project folder.

Reference

I hereby confirm that this submission is my work. I have cited above the origins of any parts of the submission that were taken from Websites, books, forums, blog posts, github repositories, etc. Status API Training Shop Blog About