Fix: Replace polyglot with NLTK alternatives to resolve HTTP 403 error #65
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fix: Replace polyglot with NLTK alternatives to resolve HTTP 403 error
Summary
This PR resolves the HTTP 403 error that occurs when running
ch3/3_1_wordsteam.pyby replacing the polyglot library dependency with NLTK-based alternatives. The polyglot download server (http://polyglot.cs.stonybrook.edu/~polyglot/) is permanently down or misconfigured, returning 403 Forbidden errors when attempting to download morpheme analyzer resources.Changes made:
polyglot.text.Wordpolyglot_stem()function withnltk_alternative_stem()using NLTK's Porter, Lancaster, and Snowball stemmersRoot cause: The polyglot library's server infrastructure is no longer accessible, causing runtime failures. This is a known issue affecting many users (see GitHub issues #204, #282, etc.).
Review & Testing Checklist for Human
python3 ch3/3_1_wordsteam.pypip install nltkNotes
Session URL: https://app.devin.ai/sessions/c36fa9675f274b57b1138a74ecbd8a7f
Requested by: jalajthanaki@gmail.com (@jalajthanaki)