The method for detecting the negative terms in Chinese electronic medical record (EMR) is useful in providing evidence for constructing concept index. In this respect, we adopted an improved method which combined maximum matching with mutual information in order to extract terms in EMRs. This method can overcome the influence of overlay ambiguity. In addition, for the determination of negative semantic, we also adopted an improved method which combined rule-based method with word co-occurrence. This new method can reduce the probability of appearance of false positive terms caused by punctuation input errors. The result showed that the negative predictive value is 7.85% higher than the rule-based method.
Aiming at feature selection problem of motor imagery task in brain computer interface (BCI), an algorithm based on mutual information and principal component analysis (PCA) for electroencephalogram (EEG) feature selection is presented. This algorithm introduces the category information, and uses the sum of mutual information matrices between features under different motor imagery category to replace the covariance matrix. The eigenvectors of the sum matrix represent the direction of the principal components and the eigenvalues of the sum matrix are used to determine the dimensionality of principal components. 2005 International BCI competition data set was used in our experiments, and four feature extraction methods were adopted, i. e. power spectrum estimation, continuous wavelet transform, wavelet packet decomposition and Hjorth parameters. The proposed feature selection algorithm was adopted to select and combine the most useful features for classification. The results showed that relative to the PCA algorithm, our algorithm had better performance in dimensionality reduction and in classification accuracy with the assistance of support vector machine classifier under the same dimensionality of principal components.
Reconstruction of gene regulatory networks (GRNs) from large-scale expression data can mine the potential causality relationship among the genes and help understand the complex regulatory mechanisms. It is of utmost interest and has become a challenging computational problem for understanding the complex regulatory mechanisms in cellular systems. For the past decades, numerous theoretical and computational approaches have been introduced for inferring the GRNs. However, all existing methods of inferring GRNs from gene expression profiles have their strengths and weaknesses. In particular, many properties of GRNs, such as topology sparseness and non-linear dependence, are generally in regulation mechanism but are seldom taken into account simultaneously in one computational method. Some information theory algorithms do not recover the true positive edges that may have been deleted in an earlier computing process. These interaction relationships may reflect the actual relationship of genes. To overcome these disadvantages and to further enhance the precision and robustness of inferred GRNs, we presented an ensemble method, to infer GRNs from gene expression data by adopting two strategies of resampling and arithmetic mean fusion in this work. In this algorithm, the jackknife resampling procedure was first employed to form a series of sub-datasets of gene expression data, then the conditional mutual information was used to generate the corresponding sub-networks from the sub-datasets, and the final GRN was inferred by integrating these sub-networks with an arithmetic mean fusion strategy. Compared with those of the state-of-the-art algorithm on the benchmark synthetic GRNs datasets from the DREAM3 challenge and a real SOS DNA repair network, the results show that our method outperforms significantly LP, LASSO and ARANCE methods, and has a high and robust performance.
The motor nervous system transmits motion control information through nervous oscillations, which causes the synchronous oscillatory activity of the corresponding muscle to reflect the motion response information and give the cerebral cortex feedback, so that it can sense the state of the limbs. This synchronous oscillatory activity can reflect connectivity information of electroencephalography-electromyography (EEG-EMG) functional coupling. The strength of the coupling is determined by various factors including the strength of muscle contraction, attention, motion intention etc. It is very significant to study motor functional evaluation and control methods to analyze the changes of EEG-EMG synchronous coupling caused by different factors. This article mainly introduces and compares coherence and Granger causality of linear methods, the mutual information and transfer entropy of nonlinear methods in EEG-EMG synchronous coupling, and summarizes the application of each method, so that researchers in related fields can understand the current research progress on analysis methods of EEG-EMG synchronous systematically.
In order to more accurately and effectively understand the intermuscular coupling of different temporal and spatial levels from the perspective of complex networks, a new multi-scale intermuscular coupling network analysis method was proposed in this paper. The multivariate variational modal decomposition (MVMD) and Copula mutual information (Copula MI) were combined to construct an intermuscular coupling network model based on MVMD-Copula MI, and the characteristics of intermuscular coupling of multiple muscles of upper limbs in different time-frequency scales during reaching exercise in healthy subjects were analyzed by using the network parameters such as node strength and clustering coefficient. The experimental results showed that there are obvious differences in the characteristics of intermuscular coupling in the six time-frequency scales. Specifically, the triceps brachii (TB) had relatively high coupling strength with the middle deltoid (MD) and posterior deltoid (PD), and the intermuscular function was closely connected. However, the biceps brachii (BB) was independent of other muscles. The intermuscular coupling network had scale differences. MVMD-Copula MI can quantitatively describe the relationship of multi-scale intermuscular coupling strength, which has good application prospects.
At present, the incidence of Parkinson’s disease (PD) is gradually increasing. This seriously affects the quality of life of patients, and the burden of diagnosis and treatment is increasing. However, the disease is difficult to intervene in early stage as early monitoring means are limited. Aiming to find an effective biomarker of PD, this work extracted correlation between each pair of electroencephalogram (EEG) channels for each frequency band using weighted symbolic mutual information and k-means clustering. The results showed that State1 of Beta frequency band (P = 0.034) and State5 of Gamma frequency band (P = 0.010) could be used to differentiate health controls and off-medication Parkinson’s disease patients. These findings indicated that there were significant differences in the resting channel-wise correlation states between PD patients and healthy subjects. However, no significant differences were found between PD-on and PD-off patients, and between PD-on patients and healthy controls. This may provide a clinical diagnosis reference for Parkinson’s disease.