Skip to content

googleinterns/localizing-paragraph-memorization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Localizing and Controlling Paragraph Level Recitation

Explores how small open source LMs like GPT-Neo implement paragraph level recitation from the training data. Includes helper scripts and exploratory Jupyter notebooks. Work done by Niklas Stoehr during his winter 2023 research internship.

Not an official google project

Project Structure

utils


helper scripts with basic functionality that is used in different notebooks

  • patching
  • evaluation
  • dataLoaders
  • gradient
  • intervening
  • localizing
  • modelHandlers

notebooks


notebooks to reproduce the main experiments

1 descriptive

    - explorative
    - token pertubation 

2 localizing

    - activation patching
    - gradient-based attribution
            - parameter gradients
            - activation gradients
    - attention head analysis

3 editing

paragraphs


CSV file of some paragraphs that are memorized by GPT-neo-125M

About

No description, website, or topics provided.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published