Structured state space neural turing machine

What is S4D-NTM?

The structured state space neural turing machine (S4D-NTM) is a neural network model that uses structured matrices derived from the state-space model. Specifically the model is based on the structured state space model (S4D) and is designed to handle long-range dependencies in sequences via the hidden state from the structured matrices, which is stored on the Memory bank for later retrieval. The S4D-NTM model is a variant of the NTM model that uses structured matrices to improve the performance of the model on tasks that require long-range dependencies.

Ideas: https://app.diagrams.net/#G1cHGAA4ybM5Cu00LdYxlF6Wx5bUun8e5k#%7B%22pageId%22%3A%22vaXz-TmzTlR2uNQX9Hex%22%7D

Credit: adapted code from https://github.com/loudinthecloud/pytorch-ntm

Task

Temp

NTM-S4D, 256-seq, 5 epochs
NTM-S4D, 784-seq, 5 epochs (checkpoints/2024-07-0211-36-16/seq-mnist-ntm-s4d--seed-1000-epoch-5-batch-5399-2024-07-0211-36-16.json)
S4D, 256-seq, 5 epochs
S4D, 784-seq, 5 epochs

Learning material: Matrix computations and operations

Training

Train the model using the following tasks:

python train.py --task seq-mnist-ntm --checkpoint_interval 20 --report_interval 10 -pbatch_size=64 --epochs=1 --validation_interval=0 --seed 1000
...
train.py --task seq-mnist-ntm --checkpoint_interval 100 --report_interval 100 -pbatch_size=10 -puse_memory=1 --epochs=3 --validation_interval=1 --seed 1000
...
python train.py --task seq-mnist-lstm --checkpoint_interval 20 --report_interval 10 -pbatch_size=64 --epochs=100 --validation_interval=0 --seed 1000

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data/sequential_mnist/MNIST/raw		data/sequential_mnist/MNIST/raw
dummy_transformer		dummy_transformer
example		example
image_plot		image_plot
images		images
notebooks		notebooks
ntm		ntm
tasks		tasks
tests		tests
.gitignore		.gitignore
README.md		README.md
README_ssm.md		README_ssm.md
arg_parser.py		arg_parser.py
evaluate.py		evaluate.py
lstm_linear.py		lstm_linear.py
ntms4d_paper.pdf		ntms4d_paper.pdf
requirements.txt		requirements.txt
requirements2.txt		requirements2.txt
train.py		train.py
train_mode.py		train_mode.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Structured state space neural turing machine

What is S4D-NTM?

Task

Temp

Learning material: Matrix computations and operations

Training

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Structured state space neural turing machine

What is S4D-NTM?

Task

Temp

Learning material: Matrix computations and operations

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages