Abstract: Data mining is the procedure of determining interesting patterns from the huge amount of data. With the intention of accessing the data faster the most supporting processes needed is clustering. Clustering is the process of identifying similarity between data according to the individuality present in the data and grouping associated data objects into clusters. Cluster ensemble is the technique to combine various runs of different clustering algorithms to obtain a general partition of the original dataset, aiming for consolidation of outcomes from a collection of individual clustering outcomes. The performances of clustering ensembles are mainly affecting by two principal factors such as diversity and quality. This paper presents the overview about the different cluster ensemble algorithm along with their methods used in cluster ensemble to improve the diversity and quality in the several cluster ensemble related papers and shows the comparative analysis of different cluster ensemble also summarize various cluster ensemble methods. Henceforth this clear analysis will be very useful for the world of clustering experts and also helps in deciding the most appropriate one to determine the problem in hand.
Abstract: This study will provide a brief description of the stress in Najdi Arabic dialect as well as Modern Standard Arabic. Beyond the analysis of stress patterns, this paper will also attempt to deal with two important phenomena that affect stress, namely epenthesis/insertion, vowel shortening, and consonant (the glottal stop) deletion.
Abstract: Groundwater is a vital water resource in many areas in the world, particularly in the Middle-East region where the water resources become scarce and depleting. Sustainable management and planning of the groundwater resources become essential and urgent given the impact of the global climate change. In the recent years, numerical models have been widely used to predict the flow pattern and assess the water resources security, as well as the groundwater quality affected by the contaminants transported. In this study, MODFLOW is used to study the current status of groundwater resources and the risk of water resource security in the region centred at Al-Najaf City, which is located in the mid-west of Iraq and adjacent to the Euphrates River. In this study, a conceptual model is built using the geologic and hydrogeologic collected for the region, together with the Digital Elevation Model (DEM) data obtained from the "Global Land Cover Facility" (GLCF) and "United State Geological Survey" (USGS) for the study area. The computer model is also implemented with the distributions of 69 wells in the area with the steady pro-defined hydraulic head along its boundaries. The model is then applied with the recharge rate (from precipitation) of 7.55 mm/year, given from the analysis of the field data in the study area for the period of 1980-2014. The hydraulic conductivity from the measurements at the locations of wells is interpolated for model use. The model is calibrated with the measured hydraulic heads at the locations of 50 of 69 wells in the domain and results show a good agreement. The standard-error-of-estimate (SEE), root-mean-square errors (RMSE), Normalized RMSE and correlation coefficient are 0.297 m, 2.087 m, 6.899% and 0.971 respectively. Sensitivity analysis is also carried out, and it is found that the model is sensitive to recharge, particularly when the rate is greater than (15mm/year). Hydraulic conductivity is found to be another parameter which can affect the results significantly, therefore it requires high quality field data. The results show that there is a general flow pattern from the west to east of the study area, which agrees well with the observations and the gradient of the ground surface. It is found that with the current operational pumping rates of the wells in the area, a dry area is resulted in Al-Najaf City due to the large quantity of groundwater withdrawn. The computed water balance with the current operational pumping quantity shows that the Euphrates River supplies water into the groundwater of approximately 11759 m3/day, instead of gaining water of 11178 m3/day from the groundwater if no pumping from the wells. It is expected that the results obtained from the study can provide important information for the sustainable and effective planning and management of the regional groundwater resources for Al-Najaf City.
Abstract: In this paper, we present a pedestrian detection descriptor called Fused Structure and Texture (FST) features based on the combination of the local phase information with the texture features. Since the phase of the signal conveys more structural information than the magnitude, the phase congruency concept is used to capture the structural features. On the other hand, the Center-Symmetric Local Binary Pattern (CSLBP) approach is used to capture the texture information of the image. The dimension less quantity of the phase congruency and the robustness of the CSLBP operator on the flat images, as well as the blur and illumination changes, lead the proposed descriptor to be more robust and less sensitive to the light variations. The proposed descriptor can be formed by extracting the phase congruency and the CSLBP values of each pixel of the image with respect to its neighborhood. The histogram of the oriented phase and the histogram of the CSLBP values for the local regions in the image are computed and concatenated to construct the FST descriptor. Several experiments were conducted on INRIA and the low resolution DaimlerChrysler datasets to evaluate the detection performance of the pedestrian detection system that is based on the FST descriptor. A linear Support Vector Machine (SVM) is used to train the pedestrian classifier. These experiments showed that the proposed FST descriptor has better detection performance over a set of state of the art feature extraction methodologies.
Abstract: Problems exist in the present construction industry in China. Conflicts hinder the development of the whole society, such as contradictions between resource reservation and a huge population, living space needs and low building production efficiency, as well as environment protection and high pollution production pattern. In order to solve the problems and find a solution, research is needed to explore a building system. By investigating the whole architectural process and contrasting analysis of light structures and heavy structures, the paper raised the concepts to cope with the existing challenges, such as design conception based on product and real construction processes, design methods focusing on components, and maximum utilization of the temporary building by optimizing the construction speed and building performance. The project was not only designed in virtual reality, but was also physically constructed in the real world. A series of aluminum light structure housing systems were dictated at last, with the characteristics of high performance, extremely rapid construction speed and also flexible function. It can be used in lots of aspects ranging from a single building in a remote area to a large residential community.
Abstract: Wireless Body Area Network (WBAN) is a short-range
wireless communication around human body for various applications
such as wearable devices, entertainment, military, and especially
medical devices. WBAN attracts the attention of continuous health
monitoring system including diagnostic procedure, early detection of
abnormal conditions, and prevention of emergency situations.
Compared to cellular network, WBAN system is more difficult to
control inter- and inner-cell interference due to the limited power,
limited calculation capability, mobility of patient, and
non-cooperation among WBANs.
In this paper, we compare the performance of resource allocation
scheme based on several Pseudo Orthogonal Codewords (POCs) to
mitigate inter-WBAN interference. Previously, the POCs are widely
exploited for a protocol sequence and optical orthogonal code. Each
POCs have different properties of auto- and cross-correlation and
spectral efficiency according to its construction of POCs. To identify
different WBANs, several different pseudo orthogonal patterns based
on POCs exploits for resource allocation of WBANs. By simulating
these pseudo orthogonal resource allocations of WBANs on
MATLAB, we obtain the performance of WBANs according to
different POCs and can analyze and evaluate the suitability of POCs
for the resource allocation in the WBANs system.
Abstract: This study attempts to consider the linkage between management and computer sciences in order to develop the software named “IntelSymb” as a demo application to prove data analysis of non-energy* fields’ diversification, which will positively influence on energy dependency mitigation of countries. Afterward, we analyzed 18 years of economic fields of development (5 sectors) of 13 countries by identifying which patterns mostly prevailed and which can be dominant in the near future. To make our analysis solid and plausible, as a future work, we suggest developing a gateway or interface, which will be connected to all available on-line data bases (WB, UN, OECD, U.S. EIA) for countries’ analysis by fields. Sample data consists of energy (TPES and energy import indicators) and non-energy industries’ (Main Science and Technology Indicator, Internet user index, and Sales and Production indicators) statistics from 13 OECD countries over 18 years (1995-2012). Our results show that the diversification of non-energy industries can have a positive effect on energy sector dependency (energy consumption and import dependence on crude oil) deceleration. These results can provide empirical and practical support for energy and non-energy industries diversification’ policies, such as the promoting of Information and Communication Technologies (ICTs), services and innovative technologies efficiency and management, in other OECD and non-OECD member states with similar energy utilization patterns and policies. Industries, including the ICT sector, generate around 4 percent of total GHG, but this is much higher — around 14 percent — if indirect energy use is included. The ICT sector itself (excluding the broadcasting sector) contributes approximately 2 percent of global GHG emissions, at just under 1 gigatonne of carbon dioxide equivalent (GtCO2eq). Ergo, this can be a good example and lesson for countries which are dependent and independent on energy, and mainly emerging oil-based economies, as well as to motivate non-energy industries diversification in order to be ready to energy crisis and to be able to face any economic crisis as well.
Abstract: Tsunami and inundation modelling due to far field tsunami propagation in a limited area is a very challenging numerical task because it involves many aspects such as the formation of various types of waves and the irregularities of coastal boundaries. To compute the effect of far field tsunami and extent of inland inundation due to far field tsunami along the coastal belts of west coast of Malaysia and Southern Thailand, a formulated boundary condition and a moving boundary condition are simultaneously used. In this study, a boundary fitted curvilinear grid system is used in order to incorporate the coastal and island boundaries accurately as the boundaries of the model domain are curvilinear in nature and the bending is high. The tsunami response of the event 26 December 2004 along the west open boundary of the model domain is computed to simulate the effect of far field tsunami. Based on the data of the tsunami source at the west open boundary of the model domain, a boundary condition is formulated and applied to simulate the tsunami response along the coastal and island boundaries. During the simulation process, a moving boundary condition is initiated instead of fixed vertical seaside wall. The extent of inland inundation and tsunami propagation pattern are computed. Some comparisons are carried out to test the validation of the simultaneous use of the two boundary conditions. All simulations show excellent agreement with the data of observation.
Abstract: Water suspensions of in-organic (metals and oxides)
and organic nano-objects (chitozan and collagen) were subjected to
the treatment of direct and alternative electrical fields. In addition to
quasi-periodical spatial patterning resonance-like performance of
spatial distributions of these suspensions has been found at low
frequencies of alternating electrical field. These resonances are
explained as the result of creation of equilibrium states of groups of
charged nano-objects with opposite signs of charges at the interparticle
distances where the forces of Coulomb attraction are
compensated by the repulsion forces induced by relatively negative
polarization of hydrated regions surrounding the nanoparticles with
respect to pure water. The low frequencies of these resonances are
explained by comparatively big distances between the particles and
their big masses with t\respect to masses of atoms constituting
molecules with high resonance frequencies. These new resonances
open a new approach to detailed modeling and understanding of
mechanisms of the influence of electrical fields on the functioning of
internal organs of living organisms at the level of cells and neurons.
Abstract: Copreneurship is a term used to describe the business
pattern of operations run by married couples who share commitment,
goals, and responsibilities in handling a business. Research conducted
overseas showed that copreneurship business activities grew quickly
and played a role in elevating families’ and nations’ socio-economic
standards. In Malaysia, copreneurship has long been cultivated by
spouses. Thus, this study aimed to explore the factors that motivate
married partners to start a copreneurship business, and who is the
dominant partner in the management of this business. The study
participants are four entrepreneurial couples who are SME business
operators selected through purposive sampling. In-depth interviews
and direct observation were used as methods of measurement for
triangulation of qualitative data in this study. The findings of the
interviews were administered using NVivo 8.0 software. The result
shows that freedom is a key factor that drives entrepreneurs to set up
copreneurship businesses, and that the husband dominates the
management aspects of the business. The study gives an overview of
the parties involved in entrepreneurship to provide understanding of
the copreneurship concept as it is practiced. This study provides
academic value by creating understanding of the importance of a
harmonious family institution specifically for forming entrepreneurs
in the familial environment in Malaysia.
Abstract: Present paper enumerates highlights of seasonal
variation in floristic composition and ecological strategies for the
management of ‘Gujar Tal’ at Jaunpur in tropical semi-arid region of
eastern U.P. (India). Total composition of macrophytes recorded was
47 from 26 families with maximum 6 plant species of Cyperaceae
from April, 2012 to March, 2013 at certain periodic intervals.
Maximum number of plants (39) was present during winter followed
by (37) rainy and (27) summer seasons. The distribution pattern
depicted that maximum number of plants (27) was of marshy and
swampy habitats usually transitional between land and water.
Abstract: Nowadays, food safety is a great public concern;
therefore, robust and effective techniques are required for detecting
the safety situation of goods. Hyperspectral Imaging (HSI) is an
attractive material for researchers to inspect food quality and safety
estimation such as meat quality assessment, automated poultry
carcass inspection, quality evaluation of fish, bruise detection of
apples, quality analysis and grading of citrus fruits, bruise detection
of strawberry, visualization of sugar distribution of melons,
measuring ripening of tomatoes, defect detection of pickling
cucumber, and classification of wheat kernels. HSI can be used to
concurrently collect large amounts of spatial and spectral data on the
objects being observed. This technique yields with exceptional
detection skills, which otherwise cannot be achieved with either
imaging or spectroscopy alone. This paper presents a nonlinear
technique based on kernel Fukunaga-Koontz transform (KFKT) for
detection of fat content in ground meat using HSI. The KFKT which
is the nonlinear version of FKT is one of the most effective
techniques for solving problems involving two-pattern nature. The
conventional FKT method has been improved with kernel machines
for increasing the nonlinear discrimination ability and capturing
higher order of statistics of data. The proposed approach in this paper
aims to segment the fat content of the ground meat by regarding the
fat as target class which is tried to be separated from the remaining
classes (as clutter). We have applied the KFKT on visible and nearinfrared
(VNIR) hyperspectral images of ground meat to determine
fat percentage. The experimental studies indicate that the proposed
technique produces high detection performance for fat ratio in ground
meat.
Abstract: The very well-known stacked sets of numbers referred
to as Pascal’s triangle present the coefficients of the binomial
expansion of the form (x+y)n. This paper presents an approach (the
Staircase Horizontal Vertical, SHV-method) to the generalization of
planar Pascal’s triangle for polynomial expansion of the form
(x+y+z+w+r+⋯)n. The presented generalization of Pascal’s triangle
is different from other generalizations of Pascal’s triangles given in
the literature. The coefficients of the generalized Pascal’s triangles,
presented in this work, are generated by inspection, using embedded
Pascal’s triangles. The coefficients of I-variables expansion are
generated by horizontally laying out the Pascal’s elements of (I-1)
variables expansion, in a staircase manner, and multiplying them with
the relevant columns of vertically laid out classical Pascal’s elements,
hence avoiding factorial calculations for generating the coefficients
of the polynomial expansion. Furthermore, the classical Pascal’s
triangle has some pattern built into it regarding its odd and even
numbers. Such pattern is known as the Sierpinski’s triangle. In this
study, a presentation of Sierpinski-like patterns of the generalized
Pascal’s triangles is given. Applications related to those coefficients
of the binomial expansion (Pascal’s triangle), or polynomial
expansion (generalized Pascal’s triangles) can be in areas of
combinatorics, and probabilities.
Abstract: Landfill leachates contain a number of persistent pollutants, including heavy metals. They have the ability to spread in ecosystems and accumulate in fish which most of them are classified as top-consumers of trophic chains. Fish are freely swimming organisms; but perhaps, due to their species-specific ecological and behavioral properties, they often prefer the most suitable biotopes and therefore, did not avoid harmful substances or environments. That is why it is necessary to evaluate the persistent pollutant dispersion in hydroecosystem using fish tissue metal concentration. In hydroecosystems of hybrid type (e.g. river-pond-river) the distance from the pollution source could be a perfect indicator of such a kind of metal distribution. The studies were carried out in the Kairiai landfill neighboring hybrid-type ecosystem which is located 5 km east of the Šiauliai City. Fish tissue (gills, liver, and muscle) metal concentration measurements were performed on two types of ecologically-different fishes according to their feeding characteristics: benthophagous (Gibel carp, roach) and predatory (Northern pike, perch). A number of mathematical models (linear, non-linear, using log and other transformations) have been applied in order to identify the most satisfactorily description of the interdependence between fish tissue metal concentration and the distance from the pollution source. However, the only one log-multiple regression model revealed the pattern that the distance from the pollution source is closely and positively correlated with metal concentration in all predatory fish tissues studied (gills, liver, and muscle).
Abstract: The cities of Johannesburg and Pretoria both located in the Gauteng province are separated by a distance of 58 km. The traffic queues on the Ben Schoeman freeway which connects these two cities can stretch for almost 1.5 km. Vehicle traffic congestion impacts negatively on the business and the commuter’s quality of life. The goal of this paper is to identify variables that influence the flow of traffic and to design a vehicle traffic prediction model, which will predict the traffic flow pattern in advance. The model will unable motorist to be able to make appropriate travel decisions ahead of time. The data used was collected by Mikro’s Traffic Monitoring (MTM). Multi-Layer perceptron (MLP) was used individually to construct the model and the MLP was also combined with Bagging ensemble method to training the data. The cross—validation method was used for evaluating the models. The results obtained from the techniques were compared using predictive and prediction costs. The cost was computed using combination of the loss matrix and the confusion matrix. The predicted models designed shows that the status of the traffic flow on the freeway can be predicted using the following parameters travel time, average speed, traffic volume and day of month. The implications of this work is that commuters will be able to spend less time travelling on the route and spend time with their families. The logistics industry will save more than twice what they are currently spending.
Abstract: Energy consumption data, in particular those involving
public buildings, are impacted by many factors: the building structure,
climate/environmental parameters, construction, system operating
condition, and user behavior patterns. Traditional methods for data
analysis are insufficient. This paper delves into the data mining
technology to determine its application in the analysis of building
energy consumption data including energy consumption prediction,
fault diagnosis, and optimal operation. Recent literature are reviewed
and summarized, the problems faced by data mining technology in the
area of energy consumption data analysis are enumerated, and research
points for future studies are given.
Abstract: Landfill waste is a common problem as it has an
economic and environmental impact even if it is closed. Landfill
waste contains a high density of various persistent compounds such
as heavy metals, organic and inorganic materials. As persistent
compounds are slowly-degradable or even non-degradable in the
environment, they often produce sublethal or even lethal effects on
aquatic organisms. The aims of the present study were to estimate
sublethal effects of the Kairiai landfill (WGS: 55°55‘46.74“,
23°23‘28.4“) leachate on the locomotor activity of rainbow trout
Oncorhynchus mykiss juveniles using the original system package
developed in our laboratory for automated monitoring, recording and
analysis of aquatic organisms’ activity, and to determine patterns of
fish behavioral response to sublethal effects of leachate. Four
different concentrations of leachate were chosen: 0.125; 0.25; 0.5 and
1.0 mL/L (0.0025; 0.005; 0.01 and 0.002 as part of 96-hour LC50,
respectively). Locomotor activity was measured after 5, 10 and 30
minutes of exposure during 1-minute test-periods of each fish (7 fish
per treatment). The threshold-effect-concentration amounted to 0.18
mL/L (0.0036 parts of 96-hour LC50). This concentration was found
to be even 2.8-fold lower than the concentration generally assumed to
be “safe” for fish. At higher concentrations, the landfill leachate
solution elicited behavioral response of test fish to sublethal levels of
pollutants. The ability of the rainbow trout to detect and avoid
contaminants occurred after 5 minutes of exposure. The intensity of
locomotor activity reached a peak within 10 minutes, evidently
decreasing after 30 minutes. This could be explained by the
physiological and biochemical adaptation of fish to altered
environmental conditions. It has been established that the locomotor
activity of juvenile trout depends on leachate concentration and
exposure duration. Modeling of these parameters showed that the
activity of juveniles increased at higher leachate concentrations, but
slightly decreased with the increasing exposure duration. Experiment
results confirm that the behavior of rainbow trout juveniles is a
sensitive and rapid biomarker that can be used in combination with
the system for fish behavior monitoring, registration and analysis to
determine sublethal concentrations of pollutants in ambient water.
Further research should be focused on software improvement aimed
to include more parameters of aquatic organisms’ behavior and to
investigate the most rapid and appropriate behavioral responses in
different species. In practice, this study could be the basis for the
development and creation of biological early-warning systems
(BEWS).
Abstract: Myoelectric control system is the fundamental
component of modern prostheses, which uses the myoelectric signals
from an individual’s muscles to control the prosthesis movements.
The surface electromyogram signal (sEMG) being noninvasive has
been used as an input to prostheses controllers for many years.
Recent technological advances has led to the development of
implantable myoelectric sensors which enable the internal
myoelectric signal (MES) to be used as input to these prostheses
controllers. The intramuscular measurement can provide focal
recordings from deep muscles of the forearm and independent signals
relatively free of crosstalk thus allowing for more independent
control sites. However, little work has been done to compare the two
inputs. In this paper we have compared the classification accuracy of
six pattern recognition based myoelectric controllers which use
surface myoelectric signals recorded using untargeted (symmetric)
surface electrode arrays to the same controllers with multichannel
intramuscular myolectric signals from targeted intramuscular
electrodes as inputs. There was no significant enhancement in the
classification accuracy as a result of using the intramuscular EMG
measurement technique when compared to the results acquired using
the surface EMG measurement technique. Impressive classification
accuracy (99%) could be achieved by optimally selecting only five
channels of surface EMG.
Abstract: The beginning of 21st century has witnessed new
advancements in the design and use of new materials for biosensing
applications, from nano to macro, protein to tissue. Traditional
analytical methods lack a complete toolset to describe the
complexities introduced by living systems, pathological relations,
discrete hierarchical materials, cross-phase interactions, and
structure-property dependencies. Materiomics – via systematic
molecular dynamics (MD) simulation – can provide structureprocess-
property relations by using a materials science approach
linking mechanisms across scales and enables oriented biosensor
design. With this approach, DNA biosensors can be utilized to detect
disease biomarkers present in individuals’ breath such as acetone for
diabetes. Our wireless sensor array based on single-stranded DNA
(ssDNA)-decorated single-walled carbon nanotubes (SWNT) has
successfully detected trace amount of various chemicals in vapor
differentiated by pattern recognition. Here, we present how MD
simulation can revolutionize the way of design and screening of DNA
aptamers for targeting biomarkers related to oral diseases and oral
health monitoring. It demonstrates great potential to be utilized to
build a library of DNDA sequences for reliable detection of several
biomarkers of one specific disease, and as well provides a new
methodology of creating, designing, and applying of biosensors.
Abstract: In language learning, second language learners as well
as Native speakers commit errors in their attempt to achieve
competence in the target language. The realm of collocation has to do
with meaning relation between lexical items. In all human language,
there is a kind of ‘natural order’ in which words are arranged or relate
to one another in sentences so much so that when a word occurs in a
given context, the related or naturally co-occurring word will
automatically come to the mind. It becomes an error, therefore, if
students inappropriately pair or arrange such ‘naturally’ co–occurring
lexical items in a text. It has been observed that most of the second
language learners in this research group commit collocation errors. A
study of this kind is very significant as it gives insight into the kinds
of errors committed by learners. This will help the language teacher
to be able to identify the sources and causes of such errors as well as
correct them thereby guiding, helping and leading the learners
towards achieving some level of competence in the language. The
aim of the study is to understand the nature of these errors as
stumbling blocks to effective essay writing. The objective of the
study is to identify the errors, analyze their structural compositions so
as to determine whether there are similarities between students in this
regard and to find out whether there are patterns to these kinds of
errors which will enable the researcher to understand their sources
and causes. As a descriptive research, the researcher samples some
nine hundred essays collected from three hundred undergraduate
learners of English as a second language in the Federal College of
Education, Kano, North- West Nigeria, i.e. three essays per each
student. The essays which were given on three different lecture times
were of similar thematic preoccupations (i.e. same topics) and length
(i.e. same number of words). The essays were written during the
lecture hour at three different lecture occasions. The errors were
identified in a systematic manner whereby errors so identified were
recorded only once even if they occur severally in students’ essays.
The data was collated using percentages in which the identified
numbers of occurrences were converted accordingly in percentages.
The findings from the study indicate that there are similarities as well
as regular and repeated errors which provided a pattern. Based on the
pattern identified, the conclusion is that students’ collocation errors
are attributable to poor teaching and learning which resulted in wrong
generalization of rules.