IEICE globals.ieice.org Site

Author Search Result

[Author] Zhu LI(12hit)

1-12hit

A Novel Test Data Compression Scheme for SoCs Based on Block Merging and Compatibility
Tiebin WU Hengzhu LIU Botao ZHANG

PAPER

Vol:
E97-A No:7
Page(s):
1452-1460
This paper presents a novel test data compression scheme for SoCs based on block merging and compatibility. The technique exploits the properties of compatibility and inverse compatibility between consecutive blocks, consecutive merged blocks, and two halves of the encoding merged block itself to encode the pre-computed test data. The decompression circuit is simple to be implemented and has advantage of test-independent. In addition, the proposed scheme is applicable for IP cores in SoCs since it compresses the test data without requiring any structural information of the circuit under test. Experimental results demonstrate that the proposed technique can achieve an average compression ratio up to 68.02% with significant low test application time.
Overall Resource Efficiency Measure of Digital Modulation Methods
Jinzhu LIU Lianfeng SHEN

LETTER-Fundamental Theories for Communications

Vol:
E92-B No:9
Page(s):
2948-2950
A coordinate plane representation of the resource requirements of digital modulation methods is presented, and an overall resource efficiency measure is proposed. This measure can be used for the comparison of digital modulation methods and the evaluation of an emerging modulation technique. Several typical digital modulation methods are compared based on this measure to show its validity.
Template Matching Method Based on Visual Feature Constraint and Structure Constraint
Zhu LI Kojiro TOMOTSUNE Yoichi TOMIOKA Hitoshi KITAZAWA

PAPER-Image Recognition, Computer Vision

Vol:
E95-D No:8
Page(s):
2105-2115
Template matching for image sequences captured with a moving camera is very important for several applications such as Robot Vision, SLAM, ITS, and video surveillance systems. However, it is difficult to realize accurate template matching using only visual feature information such as HSV histograms, edge histograms, HOG histograms, and SIFT features, because it is affected by several phenomena such as illumination change, viewpoint change, size change, and noise. In order to realize robust tracking, structure information such as the relative position of each part of the object should be considered. In this paper, we propose a method that considers both visual feature information and structure information. Experiments show that the proposed method realizes robust tracking and determine the relationships between object parts in the scenes and those in the template.
Low Cost CORDIC-Based Configurable FFT/IFFT Processor for OFDM Systems
Dongpei LIU Hengzhu LIU Botao ZHANG Jianfeng ZHANG Shixian WANG Zhengfa LIANG

PAPER-OFDM

Vol:
E95-A No:10
Page(s):
1683-1691
High-performance FFT processor is indispensable for real-time OFDM communication systems. This paper presents a CORDIC based design of variable-length FFT processor which can perform various FFT lengths of 64/128/256/512/1024/2048/4096/8192-point. The proposed FFT processor employs memory based architecture in which mixed radix 4/2 algorithm, pipelined CORDIC, and conflict-free parallel memory access scheme are exploited. Besides, the CORDIC rotation angles are generated internally based on the transform of butterfly counter, which eliminates the need of ROM making it memory-efficient. The proposed architecture has a lower hardware complexity because it is ROM-free and with no dedicated complex multiplier. We implemented the proposed FFT processor and verified it on FPGA development platform. Additionally, the processor is also synthesized in 0.18 µm technology, the core area of the processor is 3.47 mm2 and the maximum operating frequency can be up to 500 MHz. The proposed FFT processor is better trade off performance and hardware overhead, and it can meet the speed requirement of most modern OFDM system, such as IEEE 802.11n, WiMax, 3GPP-LTE and DVB-T/H.
Radiometric Identification Based on Parameters Estimation of Transmitter Imperfections
You Zhu LI Yong Qiang JIA Hong Shu LIAO

LETTER-Communication Theory and Signals

Vol:
E103-A No:2
Page(s):
563-566
Radio signals show small characteristic differences between radio transmitters resulted from their idiosyncratic hardware properties. Based on the parameters estimation of transmitter imperfections, a novel radiometric identification method is presented in this letter. The fingerprint features of the radio are extracted from the mismatches of the modulator and the nonlinearity of the power amplifier, and used to train a support vector machine classifier to identify the class label of a new data. Experiments on real data sets demonstrate the validation of this method.
Exclusive Block Matching for Moving Object Extraction and Tracking
Zhu LI Kenichi YABUTA Hitoshi KITAZAWA

PAPER-Image Recognition, Computer Vision

Vol:
E93-D No:5
Page(s):
1263-1271
Robust object tracking is required by many vision applications, and it will be useful for the motion analysis of moving object if we can not only track the object, but also make clear the corresponding relation of each part between consecutive frames. For this purpose, we propose a new method for moving object extraction and tracking based on the exclusive block matching. We build a cost matrix consisting of the similarities between the current frame's and the previous frame's blocks and obtain the corresponding relation by solving one-to-one matching as linear assignment problem. In addition, we can track the trajectory of occluded blocks by dealing with multi-frames simultaneously.
Robust Subspace Analysis and Its Application in Microphone Array for Speech Enhancement
Zhu Liang YU Meng Hwa ER

PAPER-Microphone Array

Vol:
E88-A No:7
Page(s):
1708-1715
A robust microphone array for speech enhancement and noise suppression is studied in this paper. To overcome target signal cancellation problem of conventional beamformer caused by array imperfections or reverberation effects of acoustic enclosure, the proposed microphone array adopts an arbitrary model of channel transfer function (TF) relating microphone and speech source. Since the estimation of channel TF itself is often intractable, herein, transfer function ratio (TFR) is estimated instead and used to form a suboptimal beamformer. A robust TFR estimation method is proposed based on signal subspace analysis technique against stationary or slowly varying noise. Experiments using simulated signal and actual signal recorded in a real room illustrate that the proposed method has high performance in adverse environment.
Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network
Hao XIAO Kaikai ZHAO Guangzhu LIU

LETTER-Computer System

Pubricized:
2021/02/19
Vol:
E104-D No:5
Page(s):
772-775
This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99×, 1.95× faster and 20.38×, 3.04× more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.
CoDMA: Buffer Avoided Data Exchange in Distributed Memory Systems
Ting CHEN Hengzhu LIU Botao ZHANG

PAPER-Integrated Electronics

Vol:
E97-C No:4
Page(s):
386-391
Data exchange, in which two blocks of data are swapped between cores in distributed memory systems, necessitates additional memory buffer in a multiprocessor system-on-chip. In this paper, we propose a novel bidirectional inter-core communication mechanism called coherent direct memory access (CoDMA). The CoDMA ensures that the writing address is always less than the reading address in coherent read and write mode, so as to avoid read-after-write (RAW) errors. It features an efficient data exchanging scheme without using data buffer in the memory. A four-core single-instruction multiple-data processor is established for the experiments, based on a multi-bus network-on-chip. Experimental results show that the proposed method consumes no additional memory buffer and achieves 39% and 20% average performance improvement compared with traditional Methods 1 and 2, respectively. And a maximal of 43% reduction in memory usage is achieved, at the cost of only 0.22% more area overhead compared with the entire system.
Extraction and Tracking Moving Objects in Detail Considering Visual Feature Constraint and Structure Constraint
Zhu LI Yoichi TOMIOKA Hitoshi KITAZAWA

PAPER-Image Recognition, Computer Vision

Vol:
E96-D No:5
Page(s):
1171-1181
Detailed tracking is required for many vision applications. A visual feature-based constraint underlies most conventional motion estimation methods. For example, optical flow methods assume that the brightness of each pixel is constant in two consecutive frames. However, it is difficult to realize accurate extraction and tracking using only visual feature information, because viewpoint changes and inconsistent illumination cause the visual features of some regions of objects to appear different in consecutive frames. A structure-based constraint of objects is also necessary for tracking. In the proposed method, both visual feature matching and structure matching are formulated as a linear assignment problem and then integrated.
A 0.13 mJ/Prediction CIFAR-100 Fully Synthesizable Raster-Scan-Based Wired-Logic Processor in 16-nm FPGA Open Access
Dongzhu LI Zhijie ZHAN Rei SUMIKAWA Mototsugu HAMADA Atsutake KOSUGE Tadahiro KURODA

PAPER

Pubricized:
2023/11/24
Vol:
E107-C No:6
Page(s):
155-162
A 0.13mJ/prediction with 68.6% accuracy wired-logic deep neural network (DNN) processor is developed in a single 16-nm field-programmable gate array (FPGA) chip. Compared with conventional von-Neumann architecture DNN processors, the energy efficiency is greatly improved by eliminating DRAM/BRAM access. A technical challenge for conventional wired-logic processors is the large amount of hardware resources required for implementing large-scale neural networks. To implement a large-scale convolutional neural network (CNN) into a single FPGA chip, two technologies are introduced: (1) a sparse neural network known as a non-linear neural network (NNN), and (2) a newly developed raster-scan wired-logic architecture. Furthermore, a novel high-level synthesis (HLS) technique for wired-logic processor is proposed. The proposed HLS technique enables the automatic generation of two key components: (1) Verilog-hardware description language (HDL) code for a raster-scan-based wired-logic processor and (2) test bench code for conducting equivalence checking. The automated process significantly mitigates the time and effort required for implementation and debugging. Compared with the state-of-the-art FPGA-based processor, 238 times better energy efficiency is achieved with only a slight decrease in accuracy on the CIFAR-100 task. In addition, 7 times better energy efficiency is achieved compared with the state-of-the-art network-optimized application-specific integrated circuit (ASIC).
Having an Insight into Malware Phylogeny: Building Persistent Phylogeny Tree of Families
Jing LIU Pei Dai XIE Meng Zhu LIU Yong Jun WANG

LETTER-Information Network

Pubricized:
2018/01/09
Vol:
E101-D No:4
Page(s):
1199-1202
Malware phylogeny refers to inferring evolutionary relationships between instances of families. It has gained a lot of attention over the past several years, due to its efficiency in accelerating reverse engineering of new variants within families. Previous researches mainly focused on tree-based models. However, those approaches merely demonstrate lineage of families using dendrograms or directed trees with rough evolution information. In this paper, we propose a novel malware phylogeny construction method taking advantage of persistent phylogeny tree model, whose nodes correspond to input instances and edges represent the gain or lost of functional characters. It can not only depict directed ancestor-descendant relationships between malware instances, but also show concrete function inheritance and variation between ancestor and descendant, which is significant in variants defense. We evaluate our algorithm on three malware families and one benign family whose ground truth are known, and compare with competing algorithms. Experiments demonstrate that our method achieves a higher mean accuracy of 61.4%.

Author Search Result

[Author] Zhu LI(12hit)

A Novel Test Data Compression Scheme for SoCs Based on Block Merging and Compatibility

Overall Resource Efficiency Measure of Digital Modulation Methods

Template Matching Method Based on Visual Feature Constraint and Structure Constraint

Low Cost CORDIC-Based Configurable FFT/IFFT Processor for OFDM Systems

Radiometric Identification Based on Parameters Estimation of Transmitter Imperfections

Exclusive Block Matching for Moving Object Extraction and Tracking

Robust Subspace Analysis and Its Application in Microphone Array for Speech Enhancement

Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

CoDMA: Buffer Avoided Data Exchange in Distributed Memory Systems

Extraction and Tracking Moving Objects in Detail Considering Visual Feature Constraint and Structure Constraint

A 0.13 mJ/Prediction CIFAR-100 Fully Synthesizable Raster-Scan-Based Wired-Logic Processor in 16-nm FPGA Open Access

Having an Insight into Malware Phylogeny: Building Persistent Phylogeny Tree of Families

Latest Issue

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles