EducationalWeb

video introduction

The following instructions have been tested with Python2.7 on Linux and MacOS

You should have ElasticSearch installed and running -- https://www.elastic.co/guide/en/elasticsearch/reference/current/targz.html
Create the index in ElasticSearch by running python create_es_index.py from EducationalWeb/
Download tfidf_outputs.zip from here -- https://drive.google.com/file/d/19ia7CqaHnW3KKxASbnfs2clqRIgdTFiw/view?usp=sharing

Unzip the file and place the folder under EducationalWeb/static
Download cs410.zip from here -- https://drive.google.com/file/d/1Xiw9oSavOOeJsy_SIiIxPf4aqsuyuuh6/view?usp=sharing

Unzip the file and place the folder under EducationalWeb/pdf.js/static/slides/
Run python scraper.py from CourseProject/crawling/ to scrape lecture slides from the website
Then run python parsePDF under EducationalWeb/pdf.js/ to normalize the slides name and save one PDF into a folder with single slides.
Run python getRelatedFiles.py in EducationalWeb/pdf.js/static to get every single slide’s related slides with ranking scores
From EducationalWeb/pdf.js/build/generic/web , run the following command: gulp server
In another terminal window, run python app.py from EducationalWeb/
The site should be available at http://localhost:8096/

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
__pycache__		__pycache__
crawling		crawling
log		log
node_modules		node_modules
para_idx_data		para_idx_data
paras/inv		paras/inv
paras_nohead/inv		paras_nohead/inv
pdf.js		pdf.js
slides		slides
static		static
templates		templates
.DS_Store		.DS_Store
README.md		README.md
__init__.py		__init__.py
app.py		app.py
app.pyc		app.pyc
create_es_index.py		create_es_index.py
cs410_progress_report.pdf		cs410_progress_report.pdf
final_project_documentation.pdf		final_project_documentation.pdf
join_sections.py		join_sections.py
model.py		model.py
model.pyc		model.pyc
package-lock.json		package-lock.json
package.json		package.json
passenger_wsgi.py		passenger_wsgi.py
ranker.py		ranker.py
ranker.pyc		ranker.pyc
requirements.txt		requirements.txt