Resume-Filter
Extracting relevant information from resume using Deep Learning.
Getting Started
Installation
Steps: It is recommended to do the installation in anaconda virtual environment to avoid issues with dependencies. You can skip creating conda virtual environment(at your own risk!)
- Run
conda create --name resumefilter python=3.7.6 conda activate resumefiltergit clone https://github.com/0dust/ResumeFilter.gitcd ResumeFilterpip install -U setuptoolspip install -e .
Usage
Data Preparation
Getting training data is most challenging part due to lack of publicaly available dataset of resume. Currently, to create training data you will have to manually label the lines of resume.
1.Put the resume in data/training_data folder. Currently only .pdf and .docx format supported.
2.Run utils/create_training_data.py. A popup will be created. Annote the lines of resume in the same.
Training
3.Run start_training.py.
4.Trained model will be saved in trained_model folder.
Ready to use!
5.Put the resume to parse in data/resume_to_parse folder. Only .pdf and .docx format supported.
6.Run predict.py

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
