Turning in my Shakespeare Shtuffs! :D by kwinter213 · Pull Request #10 · sd16fall/TextMining

kwinter213 · 2016-10-03T19:21:25Z

No description provided.

poosomooso

Awesome project overall! It's a cool concept and looks like fun. I put a lot of comments here but they're mostly pretty surface level; your code looks mostly great. When doing revisions, if you have any questions about my comments, I'd be happy to clarify, since writing is difficult.

poosomooso · 2016-10-05T20:11:15Z

MiniProject1.py

+#fo = open("ShakespeareSonnets.txt", "r")
+with open('ShakespeareSonnets.txt', 'r') as f:
+	read_data = f.read()
+f.closed


What is this line for? Are you trying to ensure that the file is closed? There are interesting error throwing ways of doing this, but you probably don't need this line, since the 'with' keyword and the open method together handle file closing.

Some website told me that it saves memory or something, but I can take it out lol

poosomooso · 2016-10-05T20:13:47Z

MiniProject1.py

+	read_data = f.read()
+f.closed
+sonnetz= read_data.split('\n\n')
+sonnetLine=[0, 1, 2, 3,4, 5,6 ,7, 8, 9, 10, 11, 12, 13]


A comment explaining what this line is for would be useful. In all fairness, you do a pretty good job of explaining your code with comments for most of the rest of everything :^)

Comment on your comment: It's adorable that your emoji has a nose!

poosomooso · 2016-10-05T20:14:38Z

MiniProject1.py

+f.closed
+sonnetz= read_data.split('\n\n')
+sonnetLine=[0, 1, 2, 3,4, 5,6 ,7, 8, 9, 10, 11, 12, 13]
+import random


A little thing, but try to keep all your import statements at the top. That way, someone just casually reading through your code will know what libraries this code needs right away.

poosomooso · 2016-10-05T20:17:28Z

MiniProject1.py

+for k in range(0,len(sonnetz)-1):
+	lines=sonnetz[k].split('\n') #splits into lines
+
+	def DictRhymes():


Nice job modularizing/functionizing your code! So clean. I would recommend putting docstrings or some sort of comment under the 'def ' line, just saying what the method does, what parameters it uses, and what it returns (if it returns). Also, typically you want to have your methods outside of for loops, so they can be more generalized.

If there's a method only defined inside of a for loop, that's awful coding hygiene and I'm deeply ashamed and going to fix that, cuz my mama didn't raise me to write code like that.

poosomooso · 2016-10-05T20:27:41Z

MiniProject1.py

+	def getRhyme(num1, num2): #num1 and num2 inputs represent which lines are selected (rhyming lines)
+		lastword1=lines[num1].split(' ')[-1] #takes the last word of every other line
+		lastword2=lines[num2].split(' ')[-1]
+		if(sentiment(lastword1)[0]<=.25 and sentiment(lastword2)[0]<=.25):


Try not to use 'magic numbers' such as .25. Instead, make a variable whose name explains why the number is significant (for example, 'depressing_sentiment = .25'). This goes for pretty much any number where it's not completely obvious your intentions (like 0 or -1 are not magic numbers). An exception would be when you call 'getRhyme' 6 times--you probably shouldn't make variables for all those line numbers. However, you should probably comment there why those numbers you chose are significant (ie. A sonnet rhyme scheme always rhymes these specific line numbers)

Also in the case of the .25, having it as a variable at the top of your file or something would be really useful, so if you ever wanted to change it to .5 or something, you only have to change one variable, instead of the three times you use that number

Low key I'm cutting this entire part out of my code casually...

poosomooso · 2016-10-05T20:39:56Z

MiniProject1.py

+
+	def getMarkLine(num1): #num1 is the line that is getting processed
+		splitLine=lines[num1].split(' ')
+		for i in range (1, len(splitLine)):


This is a really cool way of writing a loop that goes backwards that I honestly have never thought of before....

Thanks bae <3

poosomooso · 2016-10-05T20:57:19Z

MiniProject1.py

+
+def genLine(line1, line2):
+	startLine1=random.choice(rhymeWords.keys()) #picks a random rhyming word**'
+	startLine2=rhymeWords[startLine1][random.randint(0,len(rhymeWords[startLine1])-1)] #picks a random rhyme to rhyme with the first rhyme


I'm pretty sure random.choice works on a list, and rhymeWords[startLine1] is a list. Just a thought to makes this line slightly more readable and shorter. This also goes for when you are trying to pick random previous words a little farther down. But what you have here definitely works.

I'm not sure that I get this one!

kwinter213 · 2016-10-05T21:31:50Z

Thanks bae <3 <3
How do I fix this exactly though???

From: Serena Chen [mailto:notifications@github.com]
Sent: Wednesday, October 05, 2016 5:01 PM
To: sd16fall/TextMining TextMining@noreply.github.com
Cc: Kimberly Winter Kimberly.Winter@students.olin.edu; Author author@noreply.github.com
Subject: Re: [sd16fall/TextMining] Turning in my Shakespeare Shtuffs! :D (#10)

@poosomooso commented on this pull request.

Awesome project overall! It's a cool concept and looks like fun. I put a lot of comments here but they're mostly pretty surface level; your code looks mostly great. When doing revisions, if you have any questions about my comments, I'd be happy to clarify, since writing is difficult.

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

@@ -0,0 +1,96 @@

+# Kimber's file

+from pattern.en import *

+rhymeWords=dict() #dictionary of words that rhyme

+generalWords=dict() #markov chain dictionary

+#fo = open("ShakespeareSonnets.txt", "r")

+with open('ShakespeareSonnets.txt', 'r') as f:

```
  read_data = f.read()
```

+f.closed

What is this line for? Are you trying to ensure that the file is closed? There are interesting error throwing ways of doing this, but you probably don't need this line, since the 'with' keyword and the open method together handle file closing.

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

@@ -0,0 +1,96 @@

+# Kimber's file

+from pattern.en import *

+rhymeWords=dict() #dictionary of words that rhyme

+generalWords=dict() #markov chain dictionary

+#fo = open("ShakespeareSonnets.txt", "r")

+with open('ShakespeareSonnets.txt', 'r') as f:

```
  read_data = f.read()
```

+f.closed

+sonnetz= read_data.split('\n\n')

+sonnetLine=[0, 1, 2, 3,4, 5,6 ,7, 8, 9, 10, 11, 12, 13]

A comment explaining what this line is for would be useful. In all fairness, you do a pretty good job of explaining your code with comments for most of the rest of everything :^)

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

@@ -0,0 +1,96 @@

+# Kimber's file

+from pattern.en import *

+rhymeWords=dict() #dictionary of words that rhyme

+generalWords=dict() #markov chain dictionary

+#fo = open("ShakespeareSonnets.txt", "r")

+with open('ShakespeareSonnets.txt', 'r') as f:

```
  read_data = f.read()
```

+f.closed

+sonnetz= read_data.split('\n\n')

+sonnetLine=[0, 1, 2, 3,4, 5,6 ,7, 8, 9, 10, 11, 12, 13]

+import random

A little thing, but try to keep all your import statements at the top. That way, someone just casually reading through your code will know what libraries this code needs right away.

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

+# Kimber's file

+from pattern.en import *

+rhymeWords=dict() #dictionary of words that rhyme

+generalWords=dict() #markov chain dictionary

+#fo = open("ShakespeareSonnets.txt", "r")

+with open('ShakespeareSonnets.txt', 'r') as f:

```
  read_data = f.read()
```

+f.closed

+sonnetz= read_data.split('\n\n')

+sonnetLine=[0, 1, 2, 3,4, 5,6 ,7, 8, 9, 10, 11, 12, 13]

+import random

+for k in range(0,len(sonnetz)-1):

  lines=sonnetz[k].split('\n') #splits into lines

```
  def DictRhymes():
```

Nice job modularizing/functionizing your code! So clean. I would recommend putting docstrings or some sort of comment under the 'def ' line, just saying what the method does, what parameters it uses, and what it returns (if it returns). Also, typically you want to have your methods outside of for loops, so they can be more generalized.

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

+for k in range(0,len(sonnetz)-1):

  lines=sonnetz[k].split('\n') #splits into lines

```
  def DictRhymes():
```

         getRhyme(0, 2) #English sonnet rhyme scheme (lines that rhyme)

```
         getRhyme(1, 3)
```
```
         getRhyme(4, 6)
```
```
         getRhyme(5, 7)
```
```
         getRhyme(8,10)
```
```
         getRhyme(9,11)
```

  def getRhyme(num1, num2): #num1 and num2 inputs represent which lines are selected (rhyming lines)

         lastword1=lines[num1].split(' ')[-1] #takes the last word of every other line

         lastword2=lines[num2].split(' ')[-1]

         if(sentiment(lastword1)[0]<=.25 and sentiment(lastword2)[0]<=.25):

Try not to use 'magic numbers' such as .25. Instead, make a variable whose name explains why the number is significant (for example, 'depressing_sentiment = .25'). This goes for pretty much any number where it's not completely obvious your intentions (like 0 or -1 are not magic numbers). An exception would be when you call 'getRhyme' 6 times--you probably shouldn't make variables for all those line numbers. However, you should probably comment there why those numbers you chose are significant (ie. A sonnet rhyme scheme always rhymes these specific line numbers)

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

```
               else:
```

                         rhymeWords[''.join(lastword1)]=[''.join(lastword2)] #otherwise makes a new list

                 if lastword2 in rhymeWords:

                         rhymeWords[lastword2].append(''.join(lastword1))

```
                 else:
```

                         rhymeWords[lastword2]=[lastword1]

```
  DictRhymes()
```

  def getMarkov(): #backwards Markov b/c the end of the line is selected first

```
         for i in range(0,12):
```
```
                 getMarkLine(i)
```

  def getMarkLine(num1): #num1 is the line that is getting processed

         splitLine=lines[num1].split(' ')

         for i in range (1, len(splitLine)):

This is a really cool way of writing a loop that goes backwards that I honestly have never thought of before....

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

+for k in range(0,len(sonnetz)-1):

  lines=sonnetz[k].split('\n') #splits into lines

```
  def DictRhymes():
```

         getRhyme(0, 2) #English sonnet rhyme scheme (lines that rhyme)

```
         getRhyme(1, 3)
```
```
         getRhyme(4, 6)
```
```
         getRhyme(5, 7)
```
```
         getRhyme(8,10)
```
```
         getRhyme(9,11)
```

  def getRhyme(num1, num2): #num1 and num2 inputs represent which lines are selected (rhyming lines)

         lastword1=lines[num1].split(' ')[-1] #takes the last word of every other line

         lastword2=lines[num2].split(' ')[-1]

         if(sentiment(lastword1)[0]<=.25 and sentiment(lastword2)[0]<=.25):

Also in the case of the .25, having it as a variable at the top of your file or something would be really useful, so if you ever wanted to change it to .5 or something, you only have to change one variable, instead of the three times you use that number

In MiniProject1.pyhttps://github.com//pull/10#pullrequestreview-2993400:

       for i in range (1, len(splitLine)):

                 curWord= splitLine[-i]

                 prevWord=splitLine[-i-1]

                 if(sentiment(curWord)[0]<=.25):

                         if curWord in generalWords: #checks for existing list

                                generalWords[curWord].append(''.join(prevWord)) #extends list if need be

```
                         else:
```

                                generalWords[''.join(curWord)]=[''.join(prevWord)] #otherwise makes a new list

```
  getMarkov()
```

+def genLine(line1, line2):

  startLine1=random.choice(rhymeWords.keys()) #picks a random rhyming word**'

  startLine2=rhymeWords[startLine1][random.randint(0,len(rhymeWords[startLine1])-1)] #picks a random rhyme to rhyme with the first rhyme

I'm pretty sure random.choice works on a list, and rhymeWords[startLine1] is a list. Just a thought to makes this line slightly more readable and shorter. This also goes for when you are trying to pick random previous words a little farther down. But what you have here definitely works.

—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHubhttps://github.com//pull/10#pullrequestreview-2993400, or mute the threadhttps://github.com/notifications/unsubscribe-auth/APGixMzII2d7pu17Z_NjTaEnZieNIfQoks5qxA_ugaJpZM4KM8LA.

poosomooso · 2016-10-05T22:00:29Z

@kwinter213 Just make your changes in your forked version, and then make another pull request! Revisions are due on Monday

poosomooso · 2016-10-23T22:31:04Z

Hey Kimber! Some comments.

Did you test your code after making the revisions? There are a couple of bugs, mostly from variables that don't exist in certain scopes.

The first thing is that functions need to be defined before you call them. Specifically in the example where you have the for loop starting in line 16:

for k in range(0,len(sonnetz)-1):
lines=sonnetz[k].split('\n') #splits into lines
DictRhymes()
getMarkov()

That throws an error because you have your 'def DictRhymes' and 'def getMarkov' afterwards. It's a problem with how python is interpreted.

Another variables issue that I see is that the getRhyme function (and markLine) is trying to get sonnet lines from the list 'lines'. However, lines doesn't exist! I assume you use 'lines' because you copied the method from the above for loop to outside the for loop. However, you need to pass in the list of sonnet lines you want to look at, or make lines a variable in the very outside scope (outside the for loop). I prefer the former, because then, when you call DictRhymes and getRhyme, it's not dependent on the list 'lines' being up to date, so it's more generalized. I don't know if this paragraph makes any sense; if it doesn't I'm happy to talk about it in person.

I think I found your bug in the rhyming words. You have 'split('\n\n')', which splits at every two lines endings. But If you have two empty lines, you actually have three new lines--one at the end of the last line of the previous sonnet, one for the first empty line, and one for the second empty line. Since you were splitting on two new-line characters (the \n), it kept the second empty line at the beginning of the next sonnet, so all your lines were off by one. Tricky bug :/

One more thing--you have great comments, but I want to suggest using docstrings instead. For example, instead of:

def DictRhymes(lines):
#This method decides which lines rhyme (and therefore which should go in the dictionary)
#void function

do:

def DictRhymes(lines):
"""his method decides which lines rhyme (and therefore which should go in the dictionary)
void function"""

Which is more of a standard python docstring. Doesn't seem super important, but it's a standard, and people reading your code will be looking for those docstrings. Also, doctests work in them, so that's nice.

But overall, awesome project!

kwinter213 added 2 commits October 3, 2016 01:14

Turning in my Shakespeare Shtuffsgit status!

146c3a7

Soft Des Project

ff88327

poosomooso reviewed Oct 5, 2016

View reviewed changes

initial commit

c53db28

Conversation

kwinter213 commented Oct 3, 2016

Uh oh!

poosomooso left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kwinter213 commented Oct 5, 2016

Uh oh!

poosomooso commented Oct 5, 2016

Uh oh!

poosomooso commented Oct 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants