Alibaba neural network defeats human in global reading test
@Alibaba says its deep neural network model has outscored humans in a global reading test, paving the way for the underlying technology to reduce the need for human input.
The Chinese tech giant’s research unit, Institute of Data Science of Technologies (IDST), said it had developed a deep-learning model that attained a score of 82.44 in Exact Match on the Stanford Question Answering Dataset(Squad). Humans had clocked a previous score of 82.304, it said.
Squad comprised more than 100,000 question-and-answer sets based on more than 500 Wikipedia articles, in which participants were required to build machine-learning models to respond to the questions. These models would be evaluated by Squad, which then would run the model on the test set.
Various universities, research institutions, and technology vendors were participants including Tencent, Google, IBM, Microsoft, Samsung, Tel-Aviv University, and South Korea’s Kangwon National University. A handful had participated multiple times in the past year including Microsoft Research Asia, which previous score of 82.136 was clocked on December 17, 2017, while Alibaba’s previous score of 79.199 was recorded on December 28, 2017.
In its statement Monday, the Chinese vendor said it was the first to surpass humans in the test, but Squad listed the Chinese vendor as shared leader alongside Microsoft Research Asia, which scored a higher 82.65. Squad highlighted Microsoft’s rank as “January 3, 2018”, while Alibaba’s was “January 5, 2018”.
A spokesperson for Alibaba explained that the dates indicated when the respective model was submitted. He told ZDNet that the actual test results officially registered by Squad for Alibaba was January 11, 2018–a day ahead of Microsoft’s–which gave the Chinese vendor the distinction of being “first” to surpass human scores.