Fraunhofer SCAI completes pilot project on information extraction from Chinese Scientific Literature

The pilot project was initiated as a feasibility study to evaluate how far current text mining technology is able to support automated information extraction from Chinese text sources such as scientific publications and the patent literature.

In the course of this project, ProMiner, the named entity recognition software developed at Fraunhofer SCAI, has been adapted to the specific requirements of text mining in Chinese scientific biomedical and pharmaceutical literature. Most commercial text mining technology is able to analyse English text, and some solutions provide functionalities for the analysis of German or French text. However, due to the steep increase in Chinese scientific output and the ever growing importance and attractiveness of the Chinese market to Western companies, the ability to automatically analyse Chinese unstructured information sources is of utmost importance for scientific and competitive intelligence aiming to closely follow what happens in China.

Evaluation of the performance of the pilot system jointly demonstrates that Chinese literature can be mined for biomedical terms with similar performance as English literature. However, “the challenge of Chinese Text Mining cannot be regarded as being solved”, Dr. Juliane Fluck, Head of the Text Mining Team at Fraunhofer SCAI makes clear: “we have just demonstrated that we are able to mine the Chinese biomedical scientific literature automatically. The real work – which is aiming at providing all functionalities needed for true knowledge discovery from Chinese unstructured text sources – starts now, after the proof-of-principle”. Prof. Martin Hofmann-Apitius, Head of the Department of Bioinformatics at Fraunhofer SCAI sheds some light onto another, rather “academic” aspect of this work: “we were in the favourable situation that we have Chinese students doing their Master degree in Life Science Informatics at Bonn-Aachen International Center for Information Technology (B-IT).

The next steps in this collaboration will see an extension to another Fraunhofer Institute: the Fraunhofer Institute for Systems and Innovation Research (ISI). ISI in Karlsruhe has strong ties to China and is specialized on monitoring Chinese research, innovation and markets. Through collaboration with the Chinese Institute of Policy and Management, an institute of the Chinese Academy of Sciences (CAS), ISI is a premier partner when it comes to understanding science and innovation in China.

About Fraunhofer:
Fraunhofer is Europe’s largest application-oriented research organization. Research of practical utility lies at the heart of all activities pursued by the Fraunhofer-Gesellschaft. Founded in 1949, the research organization undertakes applied research that drives economic development and serves the wider benefit of society. At present, the Fraunhofer-Gesellschaft maintains more than 80 research units in Germany, including 60 Fraunhofer Institutes. The majority of the more than 18,000 staff are qualified scientists and engineers, who work with an annual research budget of EUR 1.65 billion.

The Fraunhofer Institute for Algorithms and Scientific Computing SCAI conducts research in the field of computer simulations for product and process development. SCAI designs and optimizes industrial applications, implements custom solutions for production and logistics, and offers HPC and Cloud solutions. Services are based on industrial engineering and methods from applied mathematics and information technology.

Contact:

Prof. Dr. Martin Hofmann-Apitius
Head of the Department of Bioinformatics
Fraunhofer Institute for Algorithms and Scientific Computing SCAI
53754 Sankt Augustin, Germany
phone: +49 2241 14-2802
martin.hofmann-apitus@scai.fraunhofer.de

Media Contact

Michael Krapp Fraunhofer-Institut

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Back to home

Comments (0)

Write a comment

Newest articles

NASA: Mystery of life’s handedness deepens

The mystery of why life uses molecules with specific orientations has deepened with a NASA-funded discovery that RNA — a key molecule thought to have potentially held the instructions for…

What are the effects of historic lithium mining on water quality?

Study reveals low levels of common contaminants but high levels of other elements in waters associated with an abandoned lithium mine. Lithium ore and mining waste from a historic lithium…

Quantum-inspired design boosts efficiency of heat-to-electricity conversion

Rice engineers take unconventional route to improving thermophotovoltaic systems. Researchers at Rice University have found a new way to improve a key element of thermophotovoltaic (TPV) systems, which convert heat…