Quartz 4

Tag: transformers

6 items with this tag.

  • Apr 21, 2026

    Different versions of low-rank adaptation have equivalent performance after controlling for learning rate

    • transformers
    • low-rank-adaptation
  • Apr 21, 2026

    Encoder- and decoder-based models have different strengths and are equally useful on generative tasks

    • transformers
  • Apr 21, 2026

    On fixed compute budgets, mixture-of-experts models outperform dense models

    • transformers
  • Apr 21, 2026

    Poisoning attacks on LMs require a constant number of samples regardless of scale

    • transformers
  • Apr 21, 2026

    Pretraining performance does not capture effectiveness on downstream tasks

    • transformers
  • Apr 20, 2026

    Disentangled attention

    • transformers

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community