Skip to content

[Feature] Add GVA support for Lightning #81

@icavan

Description

@icavan

Similar to #55

  • Support HV > H (num_v_heads > num_qk_heads) in KDA, following the gated_delta_rule GVA pattern
  • Add corresponding test and benchmark config

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions