Add Mt_Gemm for the nonlocal_pw #6253

A-006 · 2025-05-31T02:31:17Z

What's changed?

Changed the LCAO basis type to single precision for the test in the file.
The nonlocal_pw implementation has now been added with DSP support for computation. However, there is an error in the mt_fft_device.dat when computing multiple matrices. As a temporary solution, we have added a Gemm (matrix multiplication) routine, and the results will be tested once the bug is fixed.

source/module_io/read_input_item_system.cpp

A-006 added 2 commits May 30, 2025 21:44

change globalv

05dd94b

add dsp for the nonlocal_pw

b902d17

mohanchen reviewed May 31, 2025

View reviewed changes

source/module_io/read_input_item_system.cpp Outdated Show resolved Hide resolved

A-006 and others added 3 commits June 2, 2025 12:37

modify basis name

4f92fb9

Merge branch 'develop' into fft15

03ee34a

Merge branch 'develop' into fft15

ae6dcfc

mohanchen approved these changes Jun 7, 2025

View reviewed changes

mohanchen merged commit 2b1e662 into deepmodeling:develop Jun 7, 2025
14 checks passed

mohanchen added GPU & DCU & HPC GPU and DCU and HPC related any issues Refactor Refactor ABACUS codes Features Needed The features are indeed needed, and developers should have sophisticated knowledge labels Jun 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Mt_Gemm for the nonlocal_pw #6253

Add Mt_Gemm for the nonlocal_pw #6253

Uh oh!

A-006 commented May 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add Mt_Gemm for the nonlocal_pw #6253

Add Mt_Gemm for the nonlocal_pw #6253

Uh oh!

Conversation

A-006 commented May 31, 2025

What's changed?

Uh oh!

Uh oh!

Uh oh!

Uh oh!