International Journal of Progressive Sciences and Technologies (IJPSAT)
ISSN: 2509-0119.
© 2022 Scholar AI LLC.
Vol. 34 No. 2 September 2022, pp. 205-234
Corresponding Author: Dr. Meshack Owira Amimo 205
The Decision Tree Aided Neuro-Fuzzy Inference
Characterization of the Stochastic Hydrology of the Tana
Alluvial Aquifer
Dr. Meshack Owira Amimo
1
and Dr K.S.S. Rakesh
2
1
Research Scholar, Livingstone International University of Tourism & Business Management (LIUTEBM),
Zambia,
bmoamimo@gmail.com
2
CEO, Gradxs,
India
kssrakesh@gmail.com
Abstract—The Tana Alluvial Aquifer is the name given to the little-understood aquifer which is active in the areas bordering the River
Tana Flow course as the river weaves its way through the sedimentary plains of Balambala, Garissa, Fafi and Ijara and, finally, into the
Tana Delta areas, with the common denominator being the proximity to the Lower Tana catchment, especially the riparian corridor of
the River itself, and beyond. The aquifer may extend to between five to fifteen kilometers away from the river channels course way, and
at times, it may be felt even 20 kilometers away. The geology of the locality is heterogeneous and comprise sediments whose soil
mechanics may not be easily deciphered, since some areas close to the river have very fresh water while others are saline (Bura East in
Fafi Sub County easily comes to mind here). There are areas far from the river but bearing fresh water (Mulanjo comes to mind). In
some areas, sites close to the river discharge low yield figures, whereas those located farther afield discharge favorably. The water
quality and discharge are therefore stochastic variables, subject to chance occurrence. In view of this inconsistency, and on the account
of data scarcity, the neuro-fuzzy inference algorithm was developed to map the Universe of Discourse of the Tana Alluvial Aquifer, aka
the T.A.A., as it relates to the longitudes, latitudes, depths, and discharges of the aquifers in the study area. The mapping was with
respect to aquifer discharge, the variable used to characterize an aquifer, in terms of Transmissivity and Hydraulic Conductivity,
thereby defining aquifer recharge propensity. Membership functions were developed using the trapezoidal membership family, and
fuzzy rules were appropriately evolved from the fuzzified aquifer data, before finally employing the Sugeno inference engines (in
Python) to make predictions of discharge, at each of the T.A.A. aquifer subsets mapped for fresh, saline, hard and blackish water
species. The accuracy in the outputs achieved in the areas mapped vindicated the power of the neuro-fuzzy inference systems, as the
accuracy oscillated between 92 and 99 percent, when the discharge values predicted were compared with the actual known discharge
values of the wells mapped. The water quality class characterization was then undertaken using the decision tree (DT) algorithm in
python which gave rise to a 100 percent prediction accuracy. The same DT algorithm could not successfully predict the discrete values
of aquifer discharge or EC values, with as much accuracy (but performed excellently with salinity class data), and that was why fuzzy
logic was employed. The study vindicated the use of the DT and Fuzzy Logic Algorithms as simple, yet powerful analytical tools, in
characterizing the Stochastic Hydrology of the Tana Alluvial Aquifer.
Key Words: Stochastic Hydrology, Fuzzy Logic, Decision Tree, Tana Alluvial Aquifer, Gini Index, Information Gain, Membership
Function, Universe of Discourse, Defuzzification