SE T38: Compulsory Task 1 and Compulsory Task 2
Compulsory Task 1 Follow these steps:
- Create a file called semantic.py and run all the code extracts above.
- Write a note about what you found interesting about the similarities between cat, monkey and banana and think of an example of your own.
- Run the example file with the simpler language model ‘en_core_web_sm’ and write a note on what you notice is different from the model 'en_core_web_md'.
- Host your solution on a Git host such as GitLab or GitHub.
- Remember to exclude any venv or virtualenv files from your repo.
- Add the link for your remote Git repo to a text file named semantic_similarity.txt
Compulsory Task 2 Let us build a system that will tell you what to watch next based on the word vector similarity of the description of movies.
- Create a file called watch_next.py
- Read in the movies.txt file. Each separate line is a description of a different movie.
- Your task is to create a function to return which movies a user would watch next if they have watched Planet Hulk with the description “Will he save their world or destroy it? When the Hulk becomes too dangerous for the Earth, the Illuminati trick Hulk into a shuttle and launch him into space to a planet where the Hulk can live in peace. Unfortunately, Hulk land on the planet Sakaar where he is sold into slavery and trained as a gladiator.”
- The function should take in the description as a parameter and return the title of the most similar movie.
- Host your solution on a Git host such as GitLab or GitHub.
- Remember to exclude any venv or virtualenv files from your repo.
- Add the link for your remote Git repo to your semantic_similarity.txt file.
- This project file run under Python development enviroment.
- OS: Windows 10
- Python Version: 3.7
- Download all files to your locally a special folder.
- Run Visual Studio Code first(If you haven't it then download it from Microsoft Website and install it first.)
- Open project files in Visual Studio Code.
- Runing py file that's fine.
Please contact to author if you have any advise for common problems or issues.
Weiguo Huang github.com/timhung1221
- 1.0
- Initial Release
This project is licensed under the [Weiguo Huang] License.