NLP Essentials
GitHub Author
  • Overview
    • Syllabus
    • Schedule
    • Development Environment
    • Homework
  • Text Processing
    • Frequency Analysis
    • Tokenization
    • Lemmatization
    • Regular Expressions
    • Homework
  • Language Models
    • N-gram Models
    • Smoothing
    • Maximum Likelihood Estimation
    • Entropy and Perplexity
    • Homework
  • Vector Space Models
    • Bag-of-Words Model
    • Term Weighting
    • Document Similarity
    • Document Classification
    • Homework
  • Distributional Semantics
    • Distributional Hypothesis
    • Word Representations
    • Latent Semantic Analysis
    • Neural Networks
    • Word2Vec
    • Homework
  • Contextual Encoding
    • Subword Tokenization
    • Recurrent Neural Networks
    • Transformer
    • Encoder-Decoder Framework
    • Homework
  • NLP Tasks & Applications
    • Text Classification
    • Sequence Tagging
    • Structure Parsing
    • Relation Extraction
    • Question Answering
    • Machine Translation
    • Text Summarization
    • Dialogue Management
    • Homework
  • Projects
    • Speed Dating
    • Team Formation
    • Proposal Pitch
    • Proposal Report
    • Live Demonstration
    • Final Report
    • Team Projects
      • Team Projects (2024)
    • Project Ideas
      • Project Ideas (2024)
Powered by GitBook

Copyright © 2023 All rights reserved

On this page

Was this helpful?

Export as PDF
  1. Distributional Semantics

Distributional Hypothesis

PreviousDistributional SemanticsNextWord Representations

Last updated 1 month ago

Was this helpful?

The distributional hypothesis suggests that words that occur in similar contexts tend to have similar meanings [1]. Let us examine the following two sentences with blanks:

A: I sat on a __

B: I played with my __

The blank for A can be filled with words such as {bench, chair, sofa, stool}, which convey the meaning "something to sit on" in this context. On the other hand, the blank for B can be filled with words such as {child, dog, friend, toy}, carrying the meaning of "someone/thing to play with." However, these two sets of words are not interchangeable, as it is unlikely that you would sit on a "friend" or play with a "sofa".

This hypothesis provides a potent framework for understanding how meaning is encoded in language and has become a cornerstone of modern computational linguistics and natural language processing.

Q1: Assuming that your corpus has only the following three sentences, what context would influence the meaning of the word "chair" according to the distributional hypothesis?

  1. I sat on a chair.

  2. I will chair the meeting.

  3. I am the chair of my department.

References

  1. , Zellig S. Harris, Word, 10 (2-3): 146-162, 1954.

Distributional Structure