Granularity Analysis for Spatio-Temporal Web Sensors

In recent years, many researches to mine the exploding Web world, especially User Generated Content (UGC) such as weblogs, for knowledge about various phenomena and events in the physical world have been done actively, and also Web services with the Web-mined knowledge have begun to be developed for the public. However, there are few detailed investigations on how accurately Web-mined data reflect physical-world data. It must be problematic to idolatrously utilize the Web-mined data in public Web services without ensuring their accuracy sufficiently. Therefore, this paper introduces the simplest Web Sensor and spatiotemporallynormalized Web Sensor to extract spatiotemporal data about a target phenomenon from weblogs searched by keyword(s) representing the target phenomenon, and tries to validate the potential and reliability of the Web-sensed spatiotemporal data by four kinds of granularity analyses of coefficient correlation with temperature, rainfall, snowfall, and earthquake statistics per day by region of Japan Meteorological Agency as physical-world data: spatial granularity (region-s population density), temporal granularity (time period, e.g., per day vs. per week), representation granularity (e.g., “rain" vs. “heavy rain"), and media granularity (weblogs vs. microblogs such as Tweets).

Authors:



References:
[1] K. Dave, S. Lawrence, and D. M. Pennock, "Mining the Peanut Gallery:
Opinion Extraction and Semantic Classification of Product Reviews,"
in Proc. 12th International World Wide Web Conference (WWW-03),
Hungary, pp. 519-528, 2003.
[2] S. Fujimura, M. Toyoda, and M. Kitsuregawa, "A Reputation Extraction
Method Considering Structure of Sentence," in Proc. 16th IEICE Data
Engineering Workshop (DEWS-05), Japan, 6C-i8, 2005.
[3] T. Tezuka, T. Kurashima, and K. Tanaka, "Toward Tighter Integration of
Web Search with a Geographic Information System," in Proc. 15th Int-l
World Wide Web Conference (WWW-06), Scotland, pp. 277-286, 2006.
[4] K. Inui, S. Abe, H. Morita, M. Eguchi, A. Sumida, C. Sao, K. Hara,
K. Murakami, and S. Matsuyoshi, "Experience Mining: Building a
Large-Scale Database of Personal Experiences and Opinions from Web
Documents," in Proc. 7th IEEE/WIC/ACM International Conference on
Web Intelligence (WI-08), Australia, pp. 314-321, 2008.
[5] M. A. Hearst, "Automatic Acquisition of Hyponyms from Large Text
Corpora," in Proc. 14th International Conference on Computational
Linguistics (COLING-92), France, vol. 2, pp. 539-545, 1992.
[6] M. Ruiz-Casado, E. Alfonseca, and P. Castells, "Automatising the Learning
of Lexical Patterns: An Application to the Enrichment of WordNet by
Extracting Semantic Relationships from Wikipedia," Data & Knowledge
Engineering, vol. 61, no. 3, pp. 484-499, June 2007.
[7] S. Hattori, H. Ohshima, S. Oyama, and K. Tanaka, "Mining the Web
for Hyponymy Relations based on Property Inheritance," in Proc. 10th
Asia-Pacific Web Conf. (APWeb-08), LNCS vol. 4976, pp. 99-110, 2008.
[8] S. Hattori and K. Tanaka, "Extracting Concept Hierarchy Knowledge from
the Web based on Property Inheritance and Aggregation," in Proc. 7th
IEEE/WIC/ACM International Conference on Web Intelligence (WI-08),
Australia, pp. 432-437, 2008.
[9] S. Hattori, "Object-oriented Semantic and Sensory Knowledge Extraction
from the Web," in Web Intelligence and Intelligent Agents, In-Tech, ch.
18, pp. 365-390, 2010.
[10] S. Hattori, "Hyponym Extraction from the Web based on Property
Inheritance of Text and Image Features," in Proc. 6th International
Conference on Advances in Semantic Processing (SEMAPRO-12), Spain,
pp. 109-114, 2012.
[11] T. Tezuka and K. Tanaka, "Visual Description Conversion for Enhancing
Search Engines and Navigational Systems," in Proc. 8th Asia-Pacific Web
Conference (APWeb-06), China, LNCS vol. 3841, pp. 955-960, 2006.
[12] S. Hattori, T. Tezuka, and K. Tanaka, "Mining the Web for Appearance
Description," in Proc. 18th International Conference on Database and
Expert Systems Applications (DEXA-07), Germany, LNCS vol. 4653, pp.
790-800, 2007.
[13] S. Hattori, "Peculiar Image Retrieval by Cross-Language Web-extracted
Appearance Descriptions," Int-l Journal of Computer Information Systems
and Industrial Management, MIR Labs, vol. 4, pp. 486-495, Dec. 2011.
[14] S. Hattori, "Hyponymy-Based Peculiar Image Retrieval," International
Journal of Computer Information Systems and Industrial Management
(IJCISIM), MIR Labs, vol. 5, pp. 79-88, June 2012.
[15] S. Hattori and K. Tanaka, "Mining the Web for Access Decision-Making
in Secure Spaces," in Proc. Joint 4th Int-l Conference on Soft Computing
and Intelligent Systems and 9th International Symposium on advanced
Intelligent Systems (SCIS&ISIS-08), Japan, TH-G3-4, pp. 370-375, 2008.
[16] S. Hattori, "Secure Spaces and Spatio-Temporal Weblog Sensors with
Temporal Shift and Propagation," in Proc. 1st IRAST International
Conference on Data Engineering and Internet Technology (DEIT-11),
Indonesia, LNEE vol. 157, pp. 343-349, 2011.
[17] S. Hattori, "Linearly-Combined Web Sensors for Spatio-Temporal Data
Extraction from the Web," in Proc. 6th Int-l Workshop on Spatial and
Spatiotemporal Data Mining (SSTDM-11), Canada, pp. 897-904, 2011.
[18] S. Hattori, "Spatio-Temporal Web Sensors by Social Network Analysis,"
in Proc. 3rd International Workshop on Business Applications of Social
Network Analysis (BASNA-12), Turkey, pp. 1020-1027, 2012.
[19] Japan Meteorological Agency, http://www.jma.go.jp/jma/indexe.html.
[20] S. Hattori and K. Tanaka, "Towards Building Secure Smart Spaces
for Information Security in the Physical World," Journal of Advanced
Computational Intelligence and Intelligent Informatics (JACIII), Fuji
Technology Press, vol. 11, no. 8, pp. 1023-1029, September 2007.
[21] S. Hattori and K. Tanaka, "Secure Spaces: Protecting Freedom of Information
Access in Public Places," in Proc. 5th International Conference
on Smart Homes and Health Telematics (ICOST-07), Japan, LNCS vol.
4541, pp. 99-109, 2007.
[22] S. Hattori, "Context-Aware Query Control for Secure Spaces," Journal
of Computer Technology and Application (JCTA), David Publishing, vol.
3, no. 2, pp. 130-139, February 2012.
[23] S. Hattori, "Ability-Based Expression Control for Secure Spaces," Proc.
Joint 6th International Conference on Soft Computing and Intelligent Systems
and 13th International Symposium on advanced Intelligent Systems
(SCIS&ISIS-12), Japan, F1-54-3, pp. 1298-1303, 2012.
[24] Google Web Search, http://www.google.co.jp/.