Publications

2018

  • [ICPADS] Optimizing Deep Learning Frameworks Incrementally to Get Linear Speedup: A Comparison Between IPoIB and RDMA Verbs. Chang Liu, Jianwen Wei, Yi-Chao Wang, Minhua Wen, Simon See and James Lin. IEEE ICPADS, Sentosa, Singapore, December 11-13, 2018. (CCF C类)

  • [HPC China] Performance Evaluation of ARM Cortex-A72 Multi-core Server Processors. Yi-Chao Wang, Xinxin Chen, Yujie Yang, Sicheng Zuo and James Lin. HPC China, Qingdao, China, October 18-20, 2018.

  • Optimizing a Particle-in-Cell Code on Intel Knights Landing. Minghua Wen, Min Chen, and James Lin. In IXPUG Workshop Asia 2018, ACM, Tokyo, Japan, 2018

  • [CLUSTER] OpenACC vs the Native Programming on Sunway TaihuLight: A Case Study with GTC-P. Linjin Cai, Yi-Chao Wang, William Tang, Bei Wang, Stephane Ethier, Zhao Liu and James Lin. IEEE Cluster Conference, Belfast, UK, September 10-14, 2018. (CCF C类)[pdf]  

  • [PARCO] Evaluating the SW26010 many-core processor with a micro-benchmark suite for performance optimizations[J]. James Lin, Zhigeng Xu, Linjin Cai, Akira Nukada and Satoshi Matsuoka. Parallel Computing, 2018, 77, 128-143 (CCF B类)

  • 太湖之光上利用OpenACC移植和优化GTC-P[J]. 王一超,林新华,蔡林金,TangWilliam,EthierStephane,王蓓,施忠伟,松岗聪. 计算机研究与发展, 2018, 55(4): 875-884.(EI索引])[pdf]

  • Optimizations of Preconditioned Conjugate Gradient on TaihuLight for OpenFOAM. James Lin, Minhua Wen, Delong Meng, Xin Liu, Akira Nukada and Satoshi Matsuoka. CCGrid 2018 (CCF C类) 

  • Optimization and Evaluation of VLPL-S on Knights Landing[J]. Ding D, Wen M, Zhou S, et al. Journal of Frontiers of Computer Science & Technology, 2018..[pdf]

2017

  • [ICPP] Optimizations of Two Compute-bound Scientific Kernels on SW26010 Many-core Processor. James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka. International Conference on Parallel Processing (ICPP2017),Bristol, UK, August 14-17, 2017. (CCF B类) [pdf]

  • [IPDPSW] Benchmarking SW26010 Many-Core Processor. zhigeng Xu, James Lin, Satoshi Matsuoka. 2017 IEEE International Parallel and Distributed Processing Symposium: Workshops (IPDPSW), IEEE, 2017: 432-441. [pdf]

2016

  • [Concurrency and Computation: Practice and Experience] Accelerating Asian option pricing on many-core architectures[J]. Shuo Li, James Lin. Concurrency and Computation: Practice and Experience, 2016, 28(3): 848-865. (CCF C类)[pdf]     

  • [NAOC] An Empirical Model to Form and Evolve Galaxies in Dark Matter Halos. Shi-Jie Li, You-Cai Zhang, Xiao-Hu Yang, Hui-Yuan Wang, Dylan Tweed, Cheng-Ze Liu, Lei Yang, Feng Shi, Yi Lu, Wen-Tao Luo and Jian-Wen Wei. 2016 National Astronomical Observatories, Chinese Academy of Sciences and IOP Publishing Ltd. Research in Astronomy and Astrophysics, Volume 16, Number 8. (IF=1.292) [pdf]

  • [PDCAT] Performance and Portability Studies with OpenACC Accelerated Version of GTC-P. Yueming Wei, Yichao Wang, Linjin Cai, William Tang, Bei Wang, Stephane Ethier, Simon See and James Lin. The 17th International Conference on Parallel and Distributed Computing, Applications and Technologies, Guangzhou, China, December 16-18, 2016. [pdf]

  • [HPC China] Porting and Optimizing GTC-P on TaihuLight Supercomputer with Sunway OpenACC. Yichao Wang, James Lin, Linjin Cai, William Tang, Stephane Ethier, Bei Wang, Simon See and Satoshi Matsuoka. HPC China 2016, (Best Paper Award), Xi’an, China, October 27-29, 2016. [pdf] [slides]

  • [HPC China] Parallelization and Optimization of Laser-Plasma-Interaction Simulation Based on Kepler Cluster. Haipeng Wu, Minhua Wen, Simon See and James Lin. HPC China 2016, Xi’an, China, October 27-29, 2016. [pdf] [slides]

  • [HPC China] Hybrid Implementation and Optimization of OpenFOAM on the SW26010 Many-core Processor. Delong Meng, Minhua Wen, Jianwen Wei and James Lin. HPC China 2016, Xi’an, China, October 27-29, 2016.       

  • [HPC China] Optimizing a Galaxy Group Finding Algorithm on SMP vs. Distributed Memory Cluster. Yumeng Si, Jianwen Wei, Simon See and James Lin. HPC China 2016, Xi’an, China, October 27-29, 2016. [pdf] [slides]

2015

  • [IPDPSW] Understanding Performance Portability of OpenACC for Supercomputers. Suttinee Sawadsitang, James Lin, Simon See, Francois Bodin and Satoshi Matsuoka. Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International. IEEE, 2015: 699-707. [pdf]

  • [CCGrid Workshop] An Evaluation of Unified Memory Technology on NVIDIA GPUs. Wenqiang Li, Guanghao Jin, Xuewen Cui and Simon See. 2nd Workshop on Parallel Programming Model for the Masses (PPMM2015). [pdf] [slides]

  • [CCGrid DS] Modeling Gather and Scatter with Hardware Performance Counters for Xeon Phi. James Lin, Akira Nukada and Satoshi Matsuoka. Doctoral Symposium, CCGrid, 2015. [pdf]

  • [HPC China] Accelerating Gene Clustering on Heterogeneous Clusters. Jianwen Wei, Zhigeng Xu, Bingqiang Wang, Simon See and James Lin. HPC China 2015, Wuxi, China, November 6-8, 2015. [pdf] [slides]

  • [HPC China] Evaluating Intel AVX2 Vgather Instructions with Stencils. James Lin, Qiang Qin, Shuo Li, Minhua Wen and Satoshi Matsuoka. HPC China 2015, Wuxi, China, November 6-8, 2015. [pdf]

  • [HPC China] Optimize Irregular Memory Access in Astronomic Clustering Application. He Hao, Yumeng Si, Jianwen Wei, Minhua Wen and James Lin. HPC China 2015, Wuxi, China, November 6-8, 2015. [pdf]

  • [HPC China] Implementation and Optimization of HGGF Application on Intel Xeon Phi Platform. Zhigeng Xu, Junyi Qiu, Yueming Wei, Yizhe Dong, Jianwen Wei and James Lin. HPC China 2015, Wuxi, China, November 6-8, 2015.[pdf]

2014

  • [HPC China] Node-level Memory Access Optimization on Intel Knights Corner. James Lin, Shuo Li, Jiaming Zhao and Satoshi Matsuoka. HPC China 2014, Guangzhou, China, November 4-8, 2014.[pdf]

  • [HPC China] Research and Advices on Development Patterns of Supercomputing Centers in the World. Gui'an Feng and James Lin. HPC China 2014, Guangzhou, China, November 4-8, 2014. [pdf]

2013

  • [HPC China] A NVIDIA Kepler Based Acceleration of PIC Method. Wen Minhua, James Lin and Simon See. HPC China 2013, Guilin, China, October 27-31, 2013. [pdf]

  • [HPC China] Performance Portability Evaluation for OpenACC on Intel Knights Corner and Nvidia Kepler. Yichao Wang, Qiang Qin, Simon See and James Lin. HPC China 2013, Guilin, China, October 27-31, 2013. [pdf]

2012

  • [HPC China] A GPU Based Parallel Method For Dynamic Collision Grid DSMC. Minhua Wen, James Lin and Simon See. HPC China 2012, Hunan, China, October 27-31, 2012. [word]

     


Copyright ©2013 SJTU Network & Information Center All rights reserved.