Optimal Document Archiving and Fast Information Retrieval

In this paper, an intelligent algorithm for optimal document archiving is presented. It is kown that electronic archives are very important for information system management. Minimizing the size of the stored data in electronic archive is a main issue to reduce the physical storage area. Here, the effect of different types of Arabic fonts on electronic archives size is discussed. Simulation results show that PDF is the best file format for storage of the Arabic documents in electronic archive. Furthermore, fast information detection in a given PDF file is introduced. Such approach uses fast neural networks (FNNs) implemented in the frequency domain. The operation of these networks relies on performing cross correlation in the frequency domain rather than spatial one. It is proved mathematically and practically that the number of computation steps required for the presented FNNs is less than that needed by conventional neural networks (CNNs). Simulation results using MATLAB confirm the theoretical computations.




References:
[1] Blake, Monica," Archiving of Electronic Publications," Electronic Library,
v7 n6 p376-86 Dec 1989
[2] Blake, Monica," Aspects of Electronic Archives," Electronic Publishing
Review, v6 n3 p151-67 Sep 1986
[3] GAIL M., HODGE, "Best Practices for Digital Archiving," January 2000
issue of D-Lib Magazine, Volume 6 Number 1
[4] Ibrahim S. I. "Arabic Font Recognition using Decision Trees Built from
Common Words," Journal of Computing and Information Technology -
CIT 13, 2005, 3, 211-22.
[5] James A. Storer, Thomas G., "Data compression via textual substitution,"
Journal of the ACM (JACM) Volume 29, Issue 4, October 1982, Pages:
928 - 951.
[6] Jan A., Bernard J., "Font compression and retrieval," US Patent Issued on
May 1, 2007.
[7] Khorsheed M.S., Clocksin, W.F., Spectral features for Arabic word
recognition," Acoustics, Speech, and Signal Processing, ICASSP,
Proceedings IEEE International Conference, 2000.
[8] Mohammad S., William F.," Multi-Font Arabic Word Recognition Using
Spectral Features", 15th International Conference on Pattern Recognition
(ICPR'00) - Volume 4 p. 4543
[9] Namane A., Sid-Ahmed M.A. ," Character scaling by contour method",
Pattern Analysis and Machine Intelligence, IEEE Transactions on, Jun
1990, Volume:12, page(s):600-606.
[10] Sayood K., "Introduction to Data Compression," Morgan Kaufmann,
2006
[11] Syed A. A.," System, method and computer program product for generic
outline font compression," United States Patent 6,614,940, September 2,
2003.
[12] Thomas A. Phelps, Robert W.," Two diet plans for fat PDF," Proceedings
of the ACM symposium on Document engineering Grenoble, France,
2003, Pages: 175 - 184.
[13] "Fonts", http://www.w3.org/TR/REC-CSS2/fonts.html
[14] "Graphic File Formats at a Glance",
www.visl.technion.ac.il/labs/anat/2/fileformats.pdf
[15] "PDF as a Standard for Archiving",
http://www.adobe.com/enterprise/pdfs/pdfarchiving.pdf
[16] "U.K. Records Management for Central Government",
http://www.pro.gov.uk/recordsmanagement
[17] "Victorian Electronic Records Strategy Standards and Guides",
http://www.prov.vic.gov.au/vers/standards/standards.htm
[18] "The Long-Term Preservation of Authentic Electronic Records: Findings
of the InterPARES Project", http://www.interpares.org/book/index.cfm
[19] A. A. Mohammed ," The Effect of Arabic Fonts on Electronic Archive
Size", Mansoura Journal for computer science and information systems,
vol. 4, No. 4, 2007
[20] H. M. El-Bakry, "A Novel High Speed Delay Neural Model for Fast
Pattern Recognition," Accepted for publication in Soft Computing
Journal.
[21] H. M. El-Bakry, "Fast Virus Detection by using High Speed Time Delay
Neural Networks," Accepted for publication in journal of computer
virology.
[22] H. M. El-Bakry, "New Fast Principal Component Analysis For Real-
Time Face Detection," Accepted for publication in MG&V Journal.
[23 J. W. Cooley, and J. W. Tukey, An algorithm for the machine calculation
of complex Fourier series, Math. Comput. 19, 297-301 1965.
[24] Klette R., and Zamperon, "Handbook of image processing operators, "
John Wiley & Sonsltd, 1996.
[25] H. M. El-bakry, "An Efficient Algorithm for Pattern Detection using
Combined Classifiers and Data Fusion," Accepted for publication in
Information Fusion Journal.
[26] Hazem M. El-bakry, and Mohamed Hamada "Fast Time Delay Neural
Networks for Detecting DNA Coding Regions," Proc. of Kes 2009, Part I,
LNAI AI 5711, Sٍpringer, September 28-30, 2009, pp. 334-342.
[27] H. M. El-Bakry and M. Hamada, " New Fast Decision Tree Classifier for
Identifying Protein Coding Regions," Lecture Notes in Computer Science,
Springer, ISICA 2008, LNCS 5370, 2008, pp. 489-500.
[28] H. M. El-Bakry and M. Hamada, "A New Implementation for High
Speed Neural Networks in Frequency Space," Lecture Notes in Artificial
Intelligence, Springer, KES 2008, Part I, LNAI 5177, pp. 33-40.
[29] H. M. El-Bakry, "New Faster Normalized Neural Networks for Sub-
Matrix Detection using Cross Correlation in the Frequency Domain and
Matrix Decomposition," Applied Soft Computing journal, vol. 8, issue 2,
March 2008, pp. 1131-1149.
[30] H. M. El-Bakry, and Nikos Mastorakis "New Fast Normalized Neural
Networks for Pattern Detection," Image and Vision Computing Journal,
vol. 25, issue 11, 2007, pp. 1767-1784.
[31] H. M. El-Bakry and Nikos Mastorakis, "Fast Code Detection Using High
Speed Time Delay Neural Networks," Lecture Notes in Computer
Science, Springer, vol. 4493, Part III, May 2007, pp. 764-773.
[32] H. M. El-Bakry, "New Fast Principal Component Analysis for Face
Detection," Journal of Advanced Computational Intelligence and
Intelligent Informatics, vol.11, no.2, 2007, pp. 195-201.
[33] H. M. El-Bakry, "New Fast Time Delay Neural Networks Using Cross
Correlation Performed in the Frequency Domain," Neurocomputing
Journal, vol. 69, October 2006, pp. 2360-2363.
[34] H. M. El-Bakry, and Nikos Mastorakis, "A Novel Model of Neural
Networks for Fast Data Detection," WSEAS Transactions on Computers,
Issue 8, vol. 5, November 2006, pp. 1773-1780.
[35] H. M. El-Bakry, and N. Mastorakis, "A New Approach for Fast Face
Detection," WSEAS Transactions on Information Science and
Applications, issue 9, vol. 3, September 2006, pp. 1725-1730.
[36] H. M. El-Bakry, "A New Implementation of PCA for Fast Face
Detection," International Journal of Intelligent Technology, Vol. 1, No.2,
2006, pp. 145-153.
[37] H. M. El-Bakry, and Q. Zhao, "Fast Normalized Neural Processors For
Pattern Detection Based on Cross Correlation Implemented in the
Frequency Domain," Journal of Research and Practice in Information
Technology, Vol. 38, No.2, May 2006, pp. 151-170.
[38] H. M. El-Bakry, "Fast Painting with Different Colors Using Cross
Correlation in the Frequency Domain," International Journal of Computer
Science, vol.1, no.2, 2006, pp. 145-156.
[39] H. M. El-Bakry, "Faster PCA for Face Detection Using Cross Correlation
in the Frequency Domain," International Journal of Computer Science
and Network Security, vol.6, no. 2A, February 2006, pp.69-74.
[40] H. M. El-Bakry, "New High Speed Normalized Neural Networks for Fast
Pattern Discovery on Web Pages," International Journal of Computer
Science and Network Security, vol.6, No. 2A, February 2006, pp.142-
152.
[41] H. M. El-Bakry, and Q. Zhao, "Fast Time Delay Neural Networks,"
International Journal of Neural Systems, vol. 15, no.6, December 2005,
pp.445-455.
[42] H. M. El-Bakry, and Q. Zhao, "Speeding-up Normalized Neural
Networks For Face/Object Detection," Machine Graphics & Vision
Journal (MG&V), vol. 14, No.1, 2005, pp. 29-59.
[43] H. M. El-Bakry, "Pattern Detection Using Fast Normalized Neural
Networks," Lecture Notes in Computer Science, Springer, vol. 3696,
September 2005, pp. 447-454.
[44] H. M. El-Bakry, "Human Face Detection Using New High Speed
Modular Neural Networks," Lecture Notes in Computer Science,
Springer, vol. 3696, September 2005, pp. 543-550.
[45] H. M. El-Bakry, and Q. Zhao, "Fast Pattern Detection Using Normalized
Neural Networks and Cross Correlation in the Frequency Domain,"
EURASIP Journal on Applied Signal Processing, Special Issue on
Advances in Intelligent Vision Systems: Methods and ApplicationsÔÇöPart
I, vol. 2005, no. 13, 1 August 2005, pp. 2054-2060.
[46] H. M. El-Bakry, "A New High Speed Neural Model For Character
Recognition Using Cross Correlation and Matrix Decomposition,"
International Journal of Signal Processing, vol.2, no.3, 2005, pp. 183-202.
[47] H. M. El-Bakry, and Q. Zhao, "A Fast Neural Algorithm for Serial Code
Detection in a Stream of Sequential Data," International Journal of
Information Technology, vol.2, no.1, pp. 71-90, 2005.
[48] H. M. El-Bakry, and Q. Zhao, "Fast Complex Valued Time Delay Neural
Networks," International Journal of Computational Intelligence, vol.2,
no.1, pp. 16-26, 2005.
[49] H. M. El-Bakry, and Q. Zhao, "Fast Pattern Detection Using Neural
Networks Realized in Frequency Domain," Enformatika Transactions on
Engineering, Computing, and Technology, February 25-27, 2005, pp. 89-
92.
[50] H. M. El-Bakry, and Q. Zhao, "Sub-Image Detection Using Fast Neural
Processors and Image Decomposition," Enformatika Transactions on
Engineering, Computing, and Technology, February 25-27, 2005, pp. 85-
88.
[51] H. M. El-Bakry, and Q. Zhao, "Face Detection Using Fast Neural
Processors and Image Decomposition," International Journal of
Computational Intelligence, vol.1, no.4, 2004, pp. 313-316.
[52] H. M. El-Bakry, and Q. Zhao, "A Modified Cross Correlation in the
Frequency Domain for Fast Pattern Detection Using Neural Networks,"
International Journal on Signal Processing, vol.1, no.3, 2004, pp. 188-
194.
[53] H. M. El-Bakry, and Q. Zhao, "Fast Object/Face Detection Using Neural
Networks and Fast Fourier Transform," International Journal on Signal
Processing, vol.1, no.3, 2004, pp. 182-187.
[54] H. M. El-Bakry, "Face detection using fast neural networks and image
decomposition," Neurocomputing Journal, vol. 48, October 2002, pp.
1039-1046.
[55] H. M. El-Bakry, "Human Iris Detection Using Fast Cooperative Modular
Neural Nets and Image Decomposition," Machine Graphics & Vision
Journal (MG&V), vol. 11, no. 4, 2002, pp. 498-512.
[56] H. M. El-Bakry, "Fast Face Detection Using Neural Networks and Image
Decomposition," Lecture Notes in Computer Science, Springer, vol.
2252, December, 2001, pp.205-215.
[57] H. M. El-Bakry "Fast Iris Detection for Personal Verification Using
Modular Neural Networks," Lecture Notes in Computer Science,
Springer, vol. 2206, October 2001, pp. 269-283.
[58] H. M. El-Bakry, "Automatic Human Face Recognition Using Modular
Neural Networks," Machine Graphics & Vision Journal (MG&V), vol.
10, no. 1, 2001, pp. 47-73.