Homework
HW0: Getting Started
Last updated
Was this helpful?
HW0: Getting Started
Last updated
Was this helpful?
In this assignment, you will:
Set up your development environment,
Install your first Python package using pip within a virtual environment,
Run a test program to verify the installation and environment configuration, and
Commit your changes and push them to your GitHub repository.
This will ensure your development workspace is properly configured for this course.
Once you set up the development environment:
Open Terminal in PyCharm:
Click the Terminal icon () at the bottom left, or
Select the [View] > [Tool Windows] > [Terminal]
menu).
Update pip to the latest version (if necessary):
Install setuptools (if necessary):
Install the ELIT Tokenizer:
You will know the installation is successful when you see "Successfully installed ..." messages for each package in the terminal output.
Create your first program:
Create a Python file called getting_started.py inside homework
.
Copy the following code into the file:
Run the program:
Choose the [Run] > [Run 'getting_started']
menu, or
Use the green run button next to the main block.
Verify the output; your program is working correctly if you see this output:
Create a .gitignore file:
Create the file in your nlp-essentials root directory
Add the following lines to exclude unnecessary files:
Stage your files for commit:
Add the following files to Git by right-clicking them and selecting [Git] > [Add]
:
Files should turn green when successfully added. If files do not change color, restart PyCharm and try again.
Commit and push your changes:
Right-click the nlp-essentials directory.
Select [Git] > [Commit Directory]
.
Write a descriptive commit message (e.g., "Initial setup and tokenizer test")
Click [Commit and Push]
(not just Commit
)
Click [Push]
in the next dialog to upload to your GitHub repository.
Verify your submission:
Visit your GitHub repository in a web browser.
Confirm that all files are properly present and contain the correct content.
Submit the URL of your GitHub repository to Canvas.
Share your team project concept by filling out the form in Canvas (about 100-150 words). Your description will be posted on the Project Ideas page to help classmates discover shared interests and form teams.
GitHub Setup (0.2 points):
Private repository created.
All instructors added as collaborators.
Project Organization (0.2 points):
Correct directory structure.
No unnecessary files committed
Version Control (0.3 points):
All required files committed and successfully pushed to GitHub.
Content of the files are correct.
Code Implementation (0.3 points):
The program executes without errors.
Produces correct tokenizer output.
Project Ideas (1 point)
Is the team project idea well described?