A System for Analyzing and Eliciting Public Grievances Using Cache Enabled Big Data

The system for analyzing and eliciting public grievances serves its main purpose to receive and process all sorts of complaints from the public and respond to users. Due to the more number of complaint data becomes big data which is difficult to store and process. The proposed system uses HDFS to store the big data and uses MapReduce to process the big data. The concept of cache was applied in the system to provide immediate response and timely action using big data analytics. Cache enabled big data increases the response time of the system. The unstructured data provided by the users are efficiently handled through map reduce algorithm. The processing of complaints takes place in the order of the hierarchy of the authority. The drawbacks of the traditional database system used in the existing system are set forth by our system by using Cache enabled Hadoop Distributed File System. MapReduce framework codes have the possible to leak the sensitive data through computation process. We propose a system that add noise to the output of the reduce phase to avoid signaling the presence of sensitive data. If the complaints are not processed in the ample time, then automatically it is forwarded to the higher authority. Hence it ensures assurance in processing. A copy of the filed complaint is sent as a digitally signed PDF document to the user mail id which serves as a proof. The system report serves to be an essential data while making important decisions based on legislation.




References:
[1] Arthur G. Erdman, Daniel F. Keefe, Senior Member, IEEE, and Randall
Schiestl, “Grand Challenge Applying Regulatory Science and Big Data
to Improve Medical Device Innovation,” IEEE Transaction on
Biomedical Engineering, vol. 60(3), pp. 700-706, March 2013.
[2] Benedikt Elser and Alberto Montresor, “An Evaluation Study of
BigData Frameworks for Graph Processing,” IEEE International
Conference on Big Data, pp.60-67, 2013. [3] C.Dobre, F.Xhafa, “Intelligent services for Big Data science,” Future
Generation Computer Systems, pp. 1-15, July 2013.
[4] C.L. Philip Chen and Chun-Yang Zhang, “Data-intensive applications,
challenges, techniques and technologies A survey on Big Data,”
Information Sciences, pp. 1-34, January 2014.
[5] Chad A. Steed, Danial M. Ricciuto, and Galen Shipman, “Big data
visual analytics for exploratory earth system simulation analysis,”
Computers & Geosciences, pp. 71-82, August 2013.
[6] Chia-Wei Lee, kuang-Yu Hsieh, Sun-Yuan Hsieh, and Hung-Chang
Hsiao, “A Dynamic Data Placement Strategy for Hadoop in
Heterogeneous Environments,” Big Data Research, pp. 14-22, July
2014.
[7] Daniel E. O’Leary, “Artificial Intelligence and Big Data,” IEEE
Intelligent Systems, pp. 96-99, March/April 2013.
[8] Foto N. Afrati and Jeffrey D. Ullman, “Optimizing Multiway joins in
Map reduce Environment,” IEEE Transaction on Knowledge and Data
Engineering, vol. 23(9), pp. 1282-1298, September 2011.
[9] Hadoop, http://hadoop.apache.org/, 2014.
[10] Juwei Shi, Wei Xue, Wenjie Wang, and yuzhou Zhang, “Scalable
community detection in massive social networks using MapReduce,”
IBM Research and Development, vol. 57(3/4), pp. 1-14, May/July 2013.
[11] Quang Tran and Hiroyuki Sato, “A Solution for Privacy Protection In
MapReduce,” IEEE 36th International Conference on Computer Software
and Applications, pp. 515-520, 2012.
[12] Yaxiong Zhao, Jie Wu, and Cong Liu, “Dache: A Data Aware Caching
for Big-Data Applications Using the MapReduce Framework,”
Tsinghuascience and technology, vol. 19(1), pp. 39-50, February 2014.