Merge branch 'master' into 'main'HEAD main

readme See merge request zyq/time_series_anomaly_detection!1
author: ZHENG Yanqin <[email protected]> 2023-05-25 07:37:53 +0000
committer: ZHENG Yanqin <[email protected]> 2023-05-25 07:37:53 +0000
commit: e9896bd62bb29da00ec00a121374167ad91bfe47 (patch)
tree: d94845574c8ef7473d0204d28b4efd4038035463 /README.md
parent: fad9aa875c84b38cbb5a6010e104922b1eea7291 (diff)
parent: 4c5734c624705449c6b21c4b2bc5554e7259fdba (diff)
1 files changed, 149 insertions, 90 deletions
diff --git a/README.md b/README.md
index 97694ef..8daa45f 100644
--- a/README.md
+++ b/README.md
@@ -1,92 +1,151 @@
-# Time Series Anomaly Detection
+时间序列异常检测实验工具脚本使用说明
+======
+说明
+------
+本工具用于进行时间序列异常检测模型实验，基于PyTorch框架实现的深度学习训练脚本。
+为了方便实验创新，本工具封装了dataloader、训练流程、测试流程、评估流程、日志记录与模型导出功能，
+并内置了部分数据集、预处理算法、模型与评估方法。
+在进行创新实验时，可以仅修改数据集、预处理算法、模型与评估方法四个内容中的一个就开始快速实验，节省实验的冗余代码开发。
+数据集、预处理算法、模型与评估方法四部分内容进行自定义时，请按照指定格式设计。
+实验的参数请通过config.ini文件进行设置。
+
+如需特殊接口，请联系开发人员郑延钦，邮箱:[email protected]
+
+安装
+------
+环境依赖
+
+python==3.9.7
+
+    * eventlet==0.33.3
+    * gevent==22.10.2
+    * loguru==0.6.0
+    * numpy==1.21.2
+    * pandas==1.4.1
+    * scikit_learn==1.2.2
+    * torch==1.10.2
+    * tqdm==4.63.1
+
+
+python第三方库安装方法
+
+    pip3 install -r requirements.txt
+    
+配置文件参数说明
+------
+config.ini文件用于配置相关设置，包括数据集、算法模型、预处理方法、评估方法、模型参数加载、实验基本参数与自定义参数。
+
+    [Dataset]
+    # 填写数据集名称（文件夹名称），同时训练测试多个数据集，请用逗号隔开
+    # 请确保数据集已经按照要求格式放置在./dataset路径下
+    name=SWAT
+    
+    [Method]
+    # 填写模型名称（文件名称，不用写".py"），同时训练测试多个算法，请用逗号隔开
+    # 请确保模型文件已经按照要求格式放置在./model路径下
+    # 模型名称请全部由字母组成
+    name=MtadGatAtt
+    
+    [Preprocess]
+    # 填写预处理方法名称（文件名称，不用写".py"）
+    # 请确保预处理方法文件已经按照要求格式放置在./preprocess路径下
+    name=standardization
+    
+    [Evaluation]
+    # 填写评估方法名称（文件名称，不用写".py"），同时使用多个评估方法，请用逗号隔开
+    # 请确保评估方法文件已经按照要求格式放置在./evaluation路径下
+    name=f1,f1pa,ftad,affiliation
+    
+    [ModelPath]
+    # 模型加载先前参数，请用"模型名_数据集名"作为变量名
+    # 路径名称从TSAD文件夹下开始写相对路径，或者直接写绝对路径
+    MtadGatAtt_SWAT=none
+    
+    [BaseParameters]
+    # train表示是否训练模型，请填写true或者false，为false时，仅测试模型
+    train=true
+    # epoch表示训练轮数，请填写正整数
+    epochs=20
+    # batch-size表示每个batch的尺寸，请填写正整数
+    batch_size=32
+    # learning-rate表示学习率，建议不要高于0.001
+    learning_rate=1e-4
+    # device表示使用的设备，可以选auto、cpu、cuda，其中auto代表自动判断，有gpu使用gpu，没有则使用cpu；选gpu时需要选择gpu序号，例如cuda:0
+    device=auto
+    
+    [CustomParameters]
+    # 此处可以填入自定义参数，数量不限，用于填写特殊需求输入，例如模型文件所需的所有外部参数，构建数据集的部分参数
+    # 输入数据长度input_size，即截取数据的窗口长度
+    input_size=60
+    # 输出数据长度output_size
+    output_size=1
+    # 截取数据的窗口的移动步长
+    step=1
+    # 数据集文件（csv）中，第一行是否需要忽略（第一行为列名的话需要忽略）
+    del_column_name=true
+    # 数据集文件（csv）中，第一列是否是时间戳
+    time_index=true
+    
+启动与停止
+------
+通过运行main.py脚本文件，启动程序。可以选择 start | stop | daemon 三个参数。
+
+* start是启动普通进程，前台运行，日志实时输出并本地记录，可输出训练进度条。
+* stop是停止当前实验进程。
+* daemon是启动守护进程，后台运行程序，日志仅保存本地，日志不包含训练进度条内容。
+
+如无输入参数，默认选择start参数。使用方法如下所示：
+
+    python3 main.py start/stop/daemon  
+    
+本工具会输出实验日志、所有模型的最佳检测标签、模型参数文件到./records路径下。
+    
+数据集文件配置说明
+-----
+所有数据集文件，均存放在./dataset路径下。
+数据集可以包含子数据集，数据以csv格式文件存储，必须包含train.csv和test.csv。
+
+* train.csv和test.csv的数据内容必须全为数值型数据，第一行可保留列名。
+* train.csv和test.csv如包含时间戳，请置于第一列。
+* test.csv最后一列为异常标签，建议用0表示正常数据，1表示异常数据（如标签含义相反，本工具会自动识别并反转标签）。
+
+数据集路径示例如下：
+
+    └── dataset
+        ├── 数据集1名称
+        │   ├── 子数据集1名称
+        │   │   ├── train.csv
+        │   │   ├── test.csv
+        │   ├── 子数据集2名称
+        ...
+        │   └── 子数据集m名称
+        ├── 数据集2名称
+        │   ├── train.csv
+        │   └── test.csv
+        ...
+        └── 数据集n名称
+    
+预处理文件配置说明
+----
+所有预处理方法的文件，均存放在./preprocess路径下，并写入__init__文件。
+每个预处理方法单独建立一个文件，新建预处理方法的命名全部采用英文和数字组成，文件名称直接使用预处理方法的命名。
+
+文件内需包含MyDataset类，MyDataset类继承torch.utils.data.Dataset，必须定义__len__、__getitem__两个方法。
+各个方法的具体设计要求，详见./preprocess/template.py文件。
+
+模型文件配置说明
+-----
+所有模型结构的文件，需存放于./method路径下，并写入__init__文件。
+每个模型单独建立一个文件，新建模型的命名全部采用英文和数字组成，文件名称直接使用模型的命名。
+
+文件内需包含Model类，Model类继承nn.Module，必须定义forward方法、loss方法和detection方法。
+各个方法的具体设计要求，详见./method/template.py文件。
+
+评估方法文件配置说明
+-----
+所有评估方法的文件，需存放于./evaluation路径下，并写入__init__文件。
+每个评估方法单独建立一个文件，新建评估方法的命名全部采用英文和数字组成，文件名称直接使用模型的命名。
+
+文件内需定义evaluate方法，方法的输入输出设计详见./evaluation/template.py文件。
 
 
-
-## Getting started
-
-To make it easy for you to get started with GitLab, here's a list of recommended next steps.
-
-Already a pro? Just edit this README.md and make it your own. Want to make it easy? [Use the template at the bottom](#editing-this-readme)!
-
-## Add your files
-
-- [ ] [Create](https://docs.gitlab.com/ee/user/project/repository/web_editor.html#create-a-file) or [upload](https://docs.gitlab.com/ee/user/project/repository/web_editor.html#upload-a-file) files
-- [ ] [Add files using the command line](https://docs.gitlab.com/ee/gitlab-basics/add-file.html#add-a-file-using-the-command-line) or push an existing Git repository with the following command:
-
-```
-cd existing_repo
-git remote add origin https://git.mesalab.cn/zyq/time_series_anomaly_detection.git
-git branch -M main
-git push -uf origin main
-```
-
-## Integrate with your tools
-
-- [ ] [Set up project integrations](https://git.mesalab.cn/zyq/time_series_anomaly_detection/-/settings/integrations)
-
-## Collaborate with your team
-
-- [ ] [Invite team members and collaborators](https://docs.gitlab.com/ee/user/project/members/)
-- [ ] [Create a new merge request](https://docs.gitlab.com/ee/user/project/merge_requests/creating_merge_requests.html)
-- [ ] [Automatically close issues from merge requests](https://docs.gitlab.com/ee/user/project/issues/managing_issues.html#closing-issues-automatically)
-- [ ] [Enable merge request approvals](https://docs.gitlab.com/ee/user/project/merge_requests/approvals/)
-- [ ] [Automatically merge when pipeline succeeds](https://docs.gitlab.com/ee/user/project/merge_requests/merge_when_pipeline_succeeds.html)
-
-## Test and Deploy
-
-Use the built-in continuous integration in GitLab.
-
-- [ ] [Get started with GitLab CI/CD](https://docs.gitlab.com/ee/ci/quick_start/index.html)
-- [ ] [Analyze your code for known vulnerabilities with Static Application Security Testing(SAST)](https://docs.gitlab.com/ee/user/application_security/sast/)
-- [ ] [Deploy to Kubernetes, Amazon EC2, or Amazon ECS using Auto Deploy](https://docs.gitlab.com/ee/topics/autodevops/requirements.html)
-- [ ] [Use pull-based deployments for improved Kubernetes management](https://docs.gitlab.com/ee/user/clusters/agent/)
-- [ ] [Set up protected environments](https://docs.gitlab.com/ee/ci/environments/protected_environments.html)
-
-***
-
-# Editing this README
-
-When you're ready to make this README your own, just edit this file and use the handy template below (or feel free to structure it however you want - this is just a starting point!). Thank you to [makeareadme.com](https://www.makeareadme.com/) for this template.
-
-## Suggestions for a good README
-Every project is different, so consider which of these sections apply to yours. The sections used in the template are suggestions for most open source projects. Also keep in mind that while a README can be too long and detailed, too long is better than too short. If you think your README is too long, consider utilizing another form of documentation rather than cutting out information.
-
-## Name
-Choose a self-explaining name for your project.
-
-## Description
-Let people know what your project can do specifically. Provide context and add a link to any reference visitors might be unfamiliar with. A list of Features or a Background subsection can also be added here. If there are alternatives to your project, this is a good place to list differentiating factors.
-
-## Badges
-On some READMEs, you may see small images that convey metadata, such as whether or not all the tests are passing for the project. You can use Shields to add some to your README. Many services also have instructions for adding a badge.
-
-## Visuals
-Depending on what you are making, it can be a good idea to include screenshots or even a video (you'll frequently see GIFs rather than actual videos). Tools like ttygif can help, but check out Asciinema for a more sophisticated method.
-
-## Installation
-Within a particular ecosystem, there may be a common way of installing things, such as using Yarn, NuGet, or Homebrew. However, consider the possibility that whoever is reading your README is a novice and would like more guidance. Listing specific steps helps remove ambiguity and gets people to using your project as quickly as possible. If it only runs in a specific context like a particular programming language version or operating system or has dependencies that have to be installed manually, also add a Requirements subsection.
-
-## Usage
-Use examples liberally, and show the expected output if you can. It's helpful to have inline the smallest example of usage that you can demonstrate, while providing links to more sophisticated examples if they are too long to reasonably include in the README.
-
-## Support
-Tell people where they can go to for help. It can be any combination of an issue tracker, a chat room, an email address, etc.
-
-## Roadmap
-If you have ideas for releases in the future, it is a good idea to list them in the README.
-
-## Contributing
-State if you are open to contributions and what your requirements are for accepting them.
-
-For people who want to make changes to your project, it's helpful to have some documentation on how to get started. Perhaps there is a script that they should run or some environment variables that they need to set. Make these steps explicit. These instructions could also be useful to your future self.
-
-You can also document commands to lint the code or run tests. These steps help to ensure high code quality and reduce the likelihood that the changes inadvertently break something. Having instructions for running tests is especially helpful if it requires external setup, such as starting a Selenium server for testing in a browser.
-
-## Authors and acknowledgment
-Show your appreciation to those who have contributed to the project.
-
-## License
-For open source projects, say how it is licensed.
-
-## Project status
-If you have run out of energy or time for your project, put a note at the top of the README saying that development has slowed down or stopped completely. Someone may choose to fork your project or volunteer to step in as a maintainer or owner, allowing your project to keep going. You can also make an explicit request for maintainers.
author	ZHENG Yanqin <[email protected]>	2023-05-25 07:37:53 +0000
committer	ZHENG Yanqin <[email protected]>	2023-05-25 07:37:53 +0000
commit	e9896bd62bb29da00ec00a121374167ad91bfe47 (patch)
tree	d94845574c8ef7473d0204d28b4efd4038035463 /README.md
parent	fad9aa875c84b38cbb5a6010e104922b1eea7291 (diff)
parent	4c5734c624705449c6b21c4b2bc5554e7259fdba (diff)