Capgemini-海洋数据和人工智能用于物种保护(英)-2022.10-14正式版.doc
《Capgemini-海洋数据和人工智能用于物种保护(英)-2022.10-14正式版.doc》由会员分享,可在线阅读,更多相关《Capgemini-海洋数据和人工智能用于物种保护(英)-2022.10-14正式版.doc(14页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、Ocean Dataand AI forspeciesconservation1Abstract.The problem and at the same time ourmotivation is the loss of species diversity in theocean, which is often also referred to as ”invisibledying”. With our approach, we pursue the goal ofmaking this dying visible and thus preventable.To make events in
2、the ocean visible, we needto identify patterns depending on the oceandepth and then recognize deviations from thesepatterns. With the Norwegian institute Lofoten-Vesteralen, we are analyzing ocean data, to helpdetect anomalies. This should enable a betterunderstanding of the ocean ecosystem. Thissho
3、uld help to identify the consequences ofhuman intervention in nature, such as dwindlingfish stocks.OCEAN DATA AND AI FOR SPECIES CONSERVATION I OCTOBER 20222IntroductionAccurate observation of ecosystems enable detailed oceanographic research, allowing anomalies to be identified in enormous amounts
4、of data with the help of artificial intelligence (AI).The Lofoten-Vesteralen (LoVe) Ocean Observatory is located west of Hovden Vesteralen in the northern part of Norway. It is located in an ecological, geological, oceanographic and economic ”hotspot”. A network of submarine cables and seven sensor
5、nodes covers a cross-section from the mainland to the deep sea. It includes a land-based station and seven sensor platforms, covering a gradient from sea level to a depth of 200m. The system continuously provides valuable online data on the marine environment in northern Norway, and has been active
6、since 2013.The system is both, a national research infrastructure, basic and applied research, as well as a test infrastructure, where industry partners can test new underwater sensors and technologies. The Lofoten-Vesteralen Ocean Observatory has collected over 100 terabytes of sensor data (tempera
7、tures, currents, echograms) over the years.The teamThomas Rammis a Software Engineer at Capgemini. He created the initial infrastructure and GitHub integration.Majed Alaitwniis a software developer. The focus of his bachelor thesis was interactive visualization for anomaly detection in ocean measure
8、ment data. He created the visualization.Geir Pedersenis a researcher at the LoVeOcean Ocean Observatory and supports the project on the Norwegian side.Tom Hattonis a Data Scientist, his masters thesis explored the use of unsupervised AI models for anomaly detection in high-dimensional ocean measurem
9、ent data. He continues to develop the AI model.Sophie Baderis a molecular biologist specializing in oceanic ecosystems, as well as a software developer. She assisted with the infrastructure.parMustapha Mustaphais a software developer at Capgemini. He has done planning work and helped design the orig
10、inal AI model.Daniel Friedmannis a software developer at Capgemini. He is an expert in containerization and Docker and provides content support for the project.Nils Olav Handegaardis a researcher at the LoVe Ocean Observatory. His research focuses on the application of new methodologies and data pro
11、cessing techniques to the fields of marine ecology and fisheries oceanography.Eldar Sultanow is Enterprise Architect Director at Capgemini. His main focus is on modern software architectures, digitalization and enterprise architecture management. He developed the code with Thomas Ramm in the initial
12、 phase and is now supervisor of the project and oversees research work.OCEAN DATA AND AI FOR SPECIES CONSERVATION I OCTOBER 20223OceAIn was created as a team name to participate in Capgeminis Global Data Science challenge (GDSC) 2021. The goal of OceAIn was to develop an AI model that gains new insi
13、ght into seasonal correlating patterns in ecosystems using the time series data which was collected by ocean sensors. This should help to build better models and understand the climate of our planet.The AI model processes data of the cross section from the mainland to the deep sea. They are collecte
14、d by four different sensors that measure (1) directional pulsating sounds in specific areas with a scientific echo sounder, (2) a so-called hydrophone, i.e. an underwater microphone that records sounds in the environment, (3) an Acoustic Doppler (ADCP) that detects thestep and cleaned and transforme
15、d in the next. These steps will take place in Docker containers, with each type of data (hydrophone, biomass detection, etc.) having its own container in every step. The collaboration of the containers is defined inApache Airflow, which works on the ”Configuration As Code” principle. Airflow allows
16、the definition of infrastructure using DAGs (Directed Acyclic Graph). It is also worth mentioning that Docker containers are actually managed by Kubernetes, which in turn is defined by Airflow. Finally, the transformed data is persisted in the form of CSV files. They are processed by the AI to detec
17、t anomalies, which are displayed with the data in the form of image files through an interactive web interface.Implementation of the AI model OceAIn includes an AI model that detects anomalies in ocean data. The sensor data for the AI model are highly variable in the type of information, as well as
18、in their duration. In addition, some types of anomalies are only detectable if the data points are considered interconnected from the beginning.The initial idea was to focus on individual models that could handle different types of data and later combine the separate models. Due to the variety of da
19、ta, this approach showed some disadvantages, such as increased complexity in aggregating the individual models.speed and direction of ocean currents using the Doppler effect, and finally, (4) point sensors that provide real-time physical, biological and chemical observations.The Identification of re
20、peating seasonal patterns and anomalies allows scientists to better monitor the marine environment. This involves widespread exploration of the anomalies and their influencing factors and drawing conclusions from the bigger picture, such as differences in fish populations, varying current patterns,
21、or the influence of climate change.While large volumes of raw data are difficult to process manually and the results are highly error-prone in the process, AI models allow filtering of this data for relevant events. In addition, AI enables continuous analyzation of incoming data, resulting in a stre
22、am of data to the researchers.Architektur des Systems Even though the AI model is the core of OceAIn, there are other components that make up the platform. Our next goal is to make them work together according to the cloud-native architecture concept, which will create a future-proof and flexible da
23、ta pipeline. The raw data provided by the institute will be collected in oneTable 1. Results of the Baseline ModelsMODELF1 SCOREPRECISIONRECALLMACROAVG F1always true1.000.300.470.23always false0.000.000.000.41uniform random0.540.400.460.58stratified0.170.210.190.44OCEAN DATA AND AI FOR SPECIES CONSE
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- Capgemini 海洋 数据 人工智能 用于 物种 保护 2022.10 14 正式版
限制150内