排序方式:
    總頁 1 
      Go

    1.Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services

    Wang, Weiyan, Jin, Y     More...

    Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[2154-817X], Published 2025, Volume 2, Pages 3055-3066

    收錄情况: SCOPUS

    顯示      條  合計   1   條
      總頁 1 
        Go

      本系統需要使用 Internet Explorer 9.0 以上Firefox || Chrome瀏覽器

      Copyright © 2018 澳門科技大學學者庫