基本的には~がユーザープロファイルディクトリ。
PS> $env:USERPROFILE で表示 (ref).
%USERPROFILE%ってexplorerに打つとユーザープロファイルディレクトリへ飛べる

2020-06-01

fireLens

Docker logging driverの出力

output: structured log¹

{
  container_id: "id",
  container_name: "name",
  source: "stdout" | "stderr",
  log: "containerLog",
}

FireLens fluentd/bitの出力

firelensで firelensConfiguration.options.enable-ecs-log-metadata = true の場合

{
  ...dockerStructureLog,
  ecs_cluster: "クラスター名",
  ecs_task_arn: "タスクARN",
  ecs_task_definition: "タスク定義名とリビジョン",
}

applicationログを使いたい場合

structured container logをstdout/stderrに吐けばいい.
形式は要相談だけど、jsonにはする.

Tagを指定できないinput

The only input plugin that don't assign Tags is Forward input.
This plugin speaks the Fluentd wire protocol called Forward where every Event already comes with a Tag associated.
Fluent Bit will always use the incoming Tag set by the client.
ref

Source events can have or not have a structure. A structure defines a set of keys and values inside the Event message. ref ↩

2020-05-21

FPS/APEX戦術論

投げ物のダメージ量

敵の密集度に比例する.
3人が固まってたら威力3倍

狭所

必要視野角

2020-05-19

上手い2つのシステム論

標準化
オーダーメイド

上手く回るシステム

A: 標準化

良い手順 (プロセス) を開発して明示し、実行主体に依存しないシステムとする
ソフトウェアで実行してもよし、正社員で実行してもよし、バイトで実行してもよし。
実行主体のリソース限界からくる制限を小さくしやすいため、スケーリングが容易。安定性も高い。

B: オーダーメイド

目的を達成するために実行主体群の能力を最大限引き出せるワンオフ・オーダーメイドの組み合わせで回すシステム。
実行主体群の能力を最大限に引き出しているので、リソースの利用効率が最大。ゆえに実行主体群が超高額だったりしない限り、総合パフォーマンスは高い.

課題

A: 標準化

標準化すべき「上手く回るシステム」をそもそもどうやって作るのか。
標準化の過程は標準化できるのか。

B: オーダーメイド

簡単にオーダーメイドできたら苦労しない。天才軍師が必要。

参考文献

なれるSE13 ~徹底指南？新人研修~

2020-05-08

Scyclone VC

科学

CycleGAN + linear spectrogram + WareRNN Vocoder => similarity MOS 4.5, naturalness MOS 3後半

[わかる人向け記事]

Overview

Masaya Tanaka, Takashi Nose, Aoi Kanagaki, Ryohei Shimizu, and Akira Ito (2020) Scyclone: High-Quality and Parallel-Data-Free Voice Conversion Using Spectrogram and Cycle-Consistent Adversarial Networks.

linear-Spec conversion with CycleGAN + simplified-WaveRNN Vocoder
Similarity is super good (demo)

prior works

CycleGAN-VC (2 groups)
CycleGAN-VC2
WaveNet family (especially WaveRNN)

Philosophies

prefer E2E: Vocoding error
prefer unified conversion: feature-feature correlation
strong Discriminator cause problems
Encoder-Decoder structure may lose linguistic info & time structure ¹

Why not mel-spec

Better practically in WaveRNN-based vocoder

the low-dimensional linear spectrogram gives a better result than the mel spectrogram as the input of the following WaveRNN-based vocoder

Setups

architecture
- spec2spec: CycleGAN
  - input: 1D spec (channel = frequency)
  - losses: hinge adversarial, L1 cycle-consistency, L1 identity
  - commons
    - network: 1D conv ResNet
      - no EncDoc structure (stride=1)
      - first layer: channel upsampling (channel doubling with pointwiseConv)
    - acticavation: LReLU
  - G specific
    - last layer: channel downsampling (channel half-nize with pointwiseConv)
  - D specific
    - normalization: SN
    - final layer: channel downsampling + global average pooling (to 1 channel with pointwiseConv => pooling)
    - additional input noise: N(0, 0.01)
- spec2wave: simplified-WaveRNN
  - Gaussian probability density function
training
- data
  - format
    - 16 kHz
    - linear spec
      - 254 Hanning window (~16msec)
      - 128-point shift (1/2 slide)
    - size: 160 frames (~1.3 sec)
      - for D, head/tail each 16 frames discarded
  - datum
    - Ayanami: 4,973 utterances
    - F009: 4,973 utterances
- params
  - Conv kernel: (5,)
  - ResNet layerNum: n_G=7, n_D=6
  - m_hinge: 0.5
  - λ_{cycle_consistency}: 10
  - λ_identity: 1
  - Adam_cycleGAN: (α, β1, β2) = (2.0 × 10−4, 0.5, 0.999)
  - Adam_WaveRNN: (α, β1, β2) = (1.0 × 10−4, 0.5, 0.999)
  - batch size: 64 & 160 (cycleGAN, WaveRNN)
evaluation
- subjective metrics (9 JP listener)
  - naturalness MOS
  - similarity MOS

Results

naturalness MOS: 3.9 (vs4.6) & 3.4 (vs4.8)
similarity MOS: 4.4 & 4.5

Note for implementation

Dimension

G: (input) 160x1x128 => (channelx2) => 160x1x256 => (ResNetLoop) => 160x1x256 => (channel 1/2) => 160x1x128
D: (input) 160x1x128 => (head/tail cut) => 128x1x128 => (channelx2) => 128x1x256 => (ResNetLoop) => 128x1x256 => (channel 1) => 128x1x1 => (global average pooling) => 1
V: (input) 160x1x128 =>

8 frames (~64msec) => 8192 units => (reshape) => 128 sampling point * 64-dim vector

" More detailed description and evaluation of Scylone will be presented in our next article." from Scyclone paper

在野情報

Google検索

有用な情報が無

Twitter

発表では女性間のみの評価ですが、男性→女性変換も問題なくできます。また、高速化したWaveRNNによりリアルタイム変換も一応動いているのでそのうちご紹介できればと。
tweet

from poster in tweet:

frame数がG240/D240になってるけど論文と違う.
- 論文だとG160/D128
iter==400K
scheduler: x0.1 per 100K

ノンパラレル声質変換手法Scyclone発表のポスターです。なお発表では女性間のみの評価ですが、男性→女性変換も問題なくできます。また、高速化したWaveRNNによりリアルタイム変換も一応動いているのでそのうちご紹介できればと。 https://t.co/cTzbpX7Pct pic.twitter.com/cpm2KFTsyv
— 能勢隆 (Takashi Nose) (@takashi_nose) 2020年9月14日

My paper read

totally enough except for chapter Ⅲ, which is roughly read (I will use other methods)

check PPG
check "simplified WaveRNN with a single Gaussian probability density function"
check [16] for hinge loss
check [18], info destroy by EncDec
noise effect: input of the discriminator to fix the instability and vanishing gradients issues [21].

My result

My reimplementation (now private)

preliminary: P100で2.37iter/sec（Google Colab Tesla P100-PCIE-16GB）。なので400K iterには46h.
CycleGAN-VC1とそこまで極端には変わらない感触（モデルサイズ的にも妥当だと思われ）

“we think that such high-level abstraction increases the risk of destroying linguistic information and the time structure of input speech.” from Scyclone paper↩

2020-05-01

システム運用実装@2020

システム運用は多岐に渡る。その実装例@2020.

運用のsteps

システム運用@wikipedia

実装

障害対応
- 監視
  - ロギング: fluentd/fluentBit stdout in_forward
  - 障害検知: fluentd/fluentBit filter
  - 集約: slack
- 通知: slack notification
- 復旧
  - 自動修復: ECS task restart
- 縮退運転/fallback: -

2020-05-01

AWS FireLens (ECS logging sidecar)

実装: Fluentd/bitの自動設定挿入

officials
- GitHub - Amazon ECS FireLens: Under the Hood
- AWS - Under the hood: FireLens for Amazon ECS Tasks

たれぱんのびぼーろく

わたしの備忘録、生物学とプログラミングが多いかも

ユーザープロファイル

ディレクトリ

Docker =>fluentd/fluentBit/fireLens

Docker logging driverの出力

FireLens fluentd/bitの出力

applicationログを使いたい場合

Tagを指定できないinput

FPS/APEX戦術論

投げ物のダメージ量

狭所

上手い2つのシステム論

上手く回るシステム

A: 標準化

B: オーダーメイド

課題

A: 標準化

B: オーダーメイド

参考文献

Scyclone VC

Overview

prior works

Philosophies

Why not mel-spec

Setups

Results

Note for implementation

Dimension

在野情報

Google検索

Twitter

My paper read

My result

システム運用実装@2020

運用のsteps

実装

AWS FireLens (ECS logging sidecar)