暂无图片
暂无图片
暂无图片
暂无图片
暂无图片
一种自主设计的面向E级高性能计算的异构融合加速器 - 刘胜, 卢凯, 郭阳, 刘仲, 陈海燕, 雷元武, 孙海燕, 杨乾明, 陈小文, 陈胜刚, 刘必慰, 鲁建壮.pdf
163
4页
1次
2021-11-16
免费下载
DOI
:
issn
JournalofCom
p
uterResearchandDevelo
p
ment
(
):
,
 
稿
:
;
:
 
:
(
YFB
)
Thisworkwassu
pp
ortedb
y
theNationalKe
y
ResearchandDevelo
p
mentPro
g
ramofChina
(
YFBsub
p
ro
j
ectI
)
E
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
(
 
 
)
(
liushen
g
nudteducn
)
ASelfDesi
g
nedHetero
g
eneousAcceleratorforExascaleHi
g
hPerformanceCom
p
utin
g
LiuShen
g
,
LuKai
,
GuoYan
g
,
LiuZhon
g
,
Chen Hai
y
an
,
LeiYuanwu
,
Sun Hai
y
an
,
Yan
g
Qianmin
g
,
ChenXiaowen
,
ChenShen
gg
an
g
,
LiuBiwei
,
andLuJianzhuan
g
(
Colle
g
eo
f
Com
p
uterScienceandTechnolo
gy
,
NationalUniversit
y
o
f
De
f
enseTechnolo
gy
,
Chan
g
sha
)
Abstract Hi
g
h
p
erformancecom
p
utin
g
(
HPC
)
isoneofthebasicfieldsto
p
romotethedevelo
p
ment
ofscienceandtechnolo
gy
Exascale HPCera
,
reco
g
nizedas
thenextcrownofsu
p
ercom
p
uter
,
is
comin
g
Theacceleratorfieldforexascale HPChas
g
raduall
y
develo
p
edintothearenaofthe most
hi
g
hendchi
p
sintheworldTheinternationalfamouscom
p
anies
,
suchasAMD
,
NVIDIAandIntel
,
haveoccu
p
iedthisfieldforseveral
y
earsAsoneoftheor
g
anizationswhichinde
p
endentl
y
desi
g
ned
p
rocessorsinChina
,
NationalUniversit
y
ofDefenseTechnolo
gy
(
NUDT
)
hasalwa
y
sbeenastron
g
com
p
etitorinHPCacceleratorfieldThis
p
a
p
erintroducesanacceleratorforexascaleHPC whichis
selfdesi
g
nedb
y
NUDTItado
p
tsahetero
g
eneousarchitecturewithCPUand
g
eneral
p
ur
p
osedi
g
ital
si
g
nal
p
rocessor
(
GPDSP
)
Ithasthecharacteristicsofhi
g
h
p
erformance
,
hi
g
hefficienc
y
andhi
g
h
p
ro
g
rammabilit
y
,
andisex
p
ectedtobetheke
y
com
p
utin
g
chi
p
ofournewexascalesu
p
ercom
p
uter
s
y
stem.
Ke
y
words hi
g
h
p
erformancecom
p
utin
g
(
HPC
);
accelerator
;
hetero
g
eneousarchitecture
;
self
desi
g
ned
;
hi
g
hefficienc
y
 
 
(
hi
g
h
p
erformancecom
p
utin
g
,
HPC
)
,
,
E
E
,
AMD
,
E
,
CPUGPDSP
,
,
E
 
;
;
;
;
 TP
;
TN
   
(
hi
g
h
p
erformancecom
p
utin
g
,
HPC
)
,
E
E
AMD
INSTINCT
TM
MI
TFLOPS
,
TFLOPS
,
0W
[
]
A
GPU
[
]
,
TFLOPS
,
(
tensorcore
)
TFLOPS
,
0W
,
Ponte Vecchio
X
e
HPC
GPU
,
TFLOPS
[
]
,
E
CPU+GPDSP
,
,
TFLOPS
,
GFLOPSW
,
E
Fi
g
 Architectureofthechi
p
 
 
,
CPU
GPDSP
_
Cluster
CPU
FTC2CPU
(
ARM
),
GPDSP
_
Cluster
FTMDSP
,
CPU
访
,
GPDSP
_
Cluster
,
GPDSP
_
Cluster
b
,
,
,
GPDSP
_
Cluster
DSP
(
DSP
DSP
)
GPDSP
_
Cluster
,
HBM
,
QoS
CrossBar
(
CrossNet
)
,
 GPDSP
DSP
,
(
ver
y
lon
g
instruction
word
,
VLIW
)
VPE
,
VPE
(
LPFetchDis
p
atch
)
(
scalarunit
)
(
vector
unit
)
,
VPE
SIMD
:
DSP
DMA
,
(
广
Su
p
erGather
)
Fi
g
 StructureofDSPCore
 DSP
 
,
CPU
Cache
(
6MB
LCache
),
GPDSP
_
Cluster
AI
,
,
GPDSP
(
MB
,
TBs
)
(
MB
,
2TBs
)
HBM
(
HBM
,
TBs
,
GB
)
,
DMA
,
 
:
E
of 4
免费下载
【版权声明】本文为墨天轮用户原创内容,转载时必须标注文档的来源(墨天轮),文档链接,文档作者等基本信息,否则作者和墨天轮有权追究责任。如果您发现墨天轮中有涉嫌抄袭或者侵权的内容,欢迎发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。

评论

关注
最新上传
暂无内容,敬请期待...
下载排行榜
Top250 周榜 月榜