This repository contains vector embeddings for the EchoNet dataset and the trained model that extracted them. It is intended as a resource for future AI researchers in Cardiology.
In the base directory there is one file containing the best 400-dimensional embeddings learned in the paper
echonet_embeddings.txt: 400-dimensional embeddings of the 10,030 echocardiogram videos from the EchoNet dataset, learned from the R3D transformer model we developed in our paper. Each embedding vector is identifiable by the video ID of the echocardiogram it represents. Those video IDs and the entire EchoNet dataset can be found at this link: https://stanfordaimi.azurewebsites.net/datasets/834e1cd1-92f7-4268-9daa-d359198b310a
This repository also contains our trained model, embedder/r3d_binary_111723.pt. Here's how to generate your own echo embeddings using it:
- Navigate to your workspace or desktop
git clone git@github.com:Team-Echo-MIT/r3d-v0-embeddings.git- clone this repository into your workspace- Navigate to the cloned repository in your workspace
- Put echocardiograms to embed as .avi files in the
embedder/echossubdirectory - Run the extraction script in the terminal:
python generate_echo_embeddings.py <your-path>/embedder/echos <your-path>/embedder/r3d_binary_111723.pt <your-path>/embedder/tensor_board(replace<your-path>with the path leading to the embedder directory in your workspace) - Embeddings will be written to a txt file in the
embedder/embeddingssubdirectory (this may take a few minutes)
R3D- name of the transformer model used to learn the embeddingsV0- version of the model (this is the first version of our model)embeddings- our focus is on the embeddings that the model learned for each echocardiogram
Contributors:
Daniel Chung
djaechung
Mindy Somin Leemindyslee
Vasu KakerVasuKaker
Yongyi Zhao