CS-838
Projects for CS-760 Machine Learning (Spring 2018)
View on GitHub
Data Science Project - CS 838
Team Members
Aravind Soundararajan (soundararaj2@wisc.edu)
Krishnan Rajagopalan (krajagopalan@wisc.edu)
Palaniappan Nagarajan (pnagarajan3@wisc.edu)
Stage 1: Information extraction from natural text
Dataset
README
Set I
Set J
Source Code
Compressed File
Stage 1 Document
Stage 2: Crawling and extracting structured data from Web pages
Data
README
Code
Stage 2 Document
Stage 3: Entity matching
Data
Table A
Table B
README
Tuples that survived blocking
Labeled Tuples
Set I
Set J
Code
Stage 3 Document
Jupyter Notebook
Stage 4: Integrating and performing analysis
Data
Table E
Matches between Table A and Table B
Data Merging - Python Script
Stage 4 Document
Stage (Bonus): Deep Matcher
Deep Matcher