Prediction of Algae Growth: A Machine Learning Perspective

Sanjeeb Tiwary
3 min readAug 11, 2023

--

Algae, a diverse group of aquatic eukaryotic organisms capable of photosynthesis, play a vital role in the biosphere as primary producers of organic matter and oxygen. However, the uncontrolled and excessive growth of algae, known as Harmful Algal Blooms (HABs), has become a global environmental concern. In a recent paper titled “Prediction of Algae Growth: A Machine Learning Perspective,” researchers delve into the potential of machine learning (ML) to predict and control the growth of harmful algal blooms.

The Algae Challenge

Algae blooms, under favourable environmental conditions, can experience exponential growth, forming vast territories known as HABs. These blooms can lead to oxygen depletion in water bodies and produce harmful toxins that pose a threat to aquatic ecosystems, human health, and industries reliant on water resources. The complex and dynamic nature of algal growth makes it challenging to predict and manage.

The Machine Learning Approach

Machine Learning (ML), a subset of Artificial Intelligence (AI), has emerged as a powerful tool to address complex and data-intensive problems. In the context of predicting algae growth, ML offers the potential to analyze and model the intricate relationships between environmental variables and algal proliferation.

The researchers propose a machine-learning approach that combines various algorithms to predict algal blooms. They utilize techniques such as Artificial Neural Networks (ANN), Gradient Boosting Decision Trees (GBDT), and Support Vector Machines (SVM) to forecast the growth of harmful algae based on selected environmental variables. These algorithms learn from historical data, enabling them to identify patterns, trends, and relationships that contribute to the development of algal blooms.

Data Sources and Experimentation

The study involves experimenting with two distinct datasets, one from Lake Okeechobee in Florida and the other from Sassafras River in the United States. These datasets include water quality parameters and information about environmental conditions associated with algal blooms.

To validate the effectiveness of the ML models, the researchers measure predictive accuracy using metrics like Mean Squared Error (MSE) and Coefficient of Determination (R2). These metrics provide insights into the accuracy of the models in predicting algal growth.

Results and Insights

After feature selection, the ML models exhibit significant improvements in predictive accuracy. For instance, the MSE of GBDT is reduced from 0.745 to 0.056, and the R2 is increased from 0.477 to 0.962 for one dataset. This indicates that the model is better at predicting algal blooms after identifying and incorporating the most relevant features.

the ANN is 0.873 which is 87.3% GBDT is 0.868 which is 86.6% and the SVM is 0.839 which is 83.9 which is shown in Table. UAR (Unforeseen Attack Robustness) is the total number of predictions which is correct to the data available and is quite simple to understand and compute. Sensitivity shows how much the model is sensitive to the data and distances between true and false data types which proportion true positives, whereas the specificity is the true negative and the accuracy is the total num
Predictive accuracy for ANN, GBDT and SVM

Conclusion and Future Directions

The study’s findings highlight the potential of machine learning in predicting algal blooms, a crucial step toward mitigating their harmful effects. The combination of ANN, GBDT, and SVM algorithms, along with feature selection, showcases the promise of ML in addressing complex environmental challenges.

However, the researchers also acknowledge the limitations and uncertainties in understanding the full dynamics of algal blooms. Further research and exploration are necessary to refine the models, incorporate additional variables, and enhance the accuracy of predictions.

In conclusion, the integration of machine learning techniques into the study of algal growth offers a novel perspective on managing the environmental and economic impacts of harmful algal blooms. As technology continues to advance, the potential for accurate predictions and effective interventions in tackling this ecological challenge becomes more attainable.

Note: This blog is based on the research article titled “Prediction of Algae Growth: A Machine Learning Perspective” authored by Sanjeeb Tiwary, Subhashree Darshana, Debabrata Mohanty, Adyasha Dash, P Rupsa, and Rabindra K. Barik.

--

--