Project Documentation

1. Overview of Code

Functions scrape_lecture_video, get_js_soup: Scrapes website to get all words from large transcripts. Allows for downstream processing.

Function tfidf: Produces top keywords from lecture video used, based on TF-IDF. Allows for use of top keywords to produce on Frontend.

With these functions, you can parse any body of text to quickly identify keywords. This comes in handy when needing to find more information on topics amongst multiple documents.

Ideally wanted to do this directly against the Coursera videos, but required more permission set up. Example code here is to use Wikipedia instead. The same idea, is to find keywords for lengthy articles, and make them noticeable to readers. Then spin up keywords in a separate HTML page, since we can't edit public pages directly (generate_webpage.html file).

2. Implementation

Used chromedriver to extract text from websites. Used sklearn package to identify keywords from extracted text.

3. Usage of software

To install chromedriver, use the main website: https://chromedriver.chromium.org/getting-started. After that, see this helpful guide to make sure it's connected: https://www.kenst.com/2015/03/including-the-chromedriver-location-in-macos-system-path/.

4. Contribution

Single person team, so I did it all :)

5. VIDEO EXPLANATION DEMO

Shared walkthrough of project here: https://drive.google.com/file/d/1Z_5aZqoJtUrOL7yy6AtaaTpPqtWW4ppK/view?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
410_final_project.py		410_final_project.py
Project Progress jkchang2.pdf		Project Progress jkchang2.pdf
Project Proposal jkchang2.pdf		Project Proposal jkchang2.pdf
README.md		README.md
generate_webpage.html		generate_webpage.html
scraped_website.csv		scraped_website.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Documentation

1. Overview of Code

2. Implementation

3. Usage of software

4. Contribution

5. VIDEO EXPLANATION DEMO

About

Uh oh!

Releases

Packages

Languages

chask8ng/CourseProject

Folders and files

Latest commit

History

Repository files navigation

Project Documentation

1. Overview of Code

2. Implementation

3. Usage of software

4. Contribution

5. VIDEO EXPLANATION DEMO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages