Skip to content

Fix the attention module embedding size#5272

Merged
vincentpierre merged 2 commits into
mainfrom
develop-fix-attn-embedding
Apr 15, 2021
Merged

Fix the attention module embedding size#5272
vincentpierre merged 2 commits into
mainfrom
develop-fix-attn-embedding

Conversation

@vincentpierre

@vincentpierre vincentpierre commented Apr 15, 2021

Copy link
Copy Markdown
Contributor

Proposed change(s)

The embedding size of attention layers is now fixed to 128 instead of being the same as embedding size.

Need to modify changelog
Need to run Sorter once and see if it solves correctly.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

  • Bug fix
  • New feature
  • Code refactor
  • Breaking change
  • Documentation update
  • Other (please describe)

Checklist

  • Added tests that prove my fix is effective or that my feature works
  • Updated the changelog (if applicable)
  • Updated the documentation (if applicable)
  • Updated the migration guide (if applicable)

Other comments

@vincentpierre vincentpierre self-assigned this Apr 15, 2021
@ervteng

ervteng commented Apr 15, 2021

Copy link
Copy Markdown
Contributor

Can we add a short blurb in the changelog? Since it may result in e.g. models not loading between R16 and R17, slight changes in behavior

@vincentpierre vincentpierre requested a review from andrewcoh April 15, 2021 22:16
@vincentpierre

Copy link
Copy Markdown
Contributor Author

@andrewcoh I am currently running Sorter to make sure it works. Can I have your eyes on the implementation to see if it makes sense?

@vincentpierre vincentpierre requested a review from ervteng April 15, 2021 22:17
@vincentpierre vincentpierre merged commit 9ae2c28 into main Apr 15, 2021
@github-actions github-actions Bot locked as resolved and limited conversation to collaborators Apr 16, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants