Qlean Dataset、「日本語・1話者・講談の音声コーパスとトランスクリプト」を提供開始

2026.02.20 09:00

PR TIMES

Visual Bank株式会社
～GENIAC採択企業のVisual Bank、伝統話芸領域の音声・テキストデータで音声・言語系AI研究を支援～

Visual Bank株式会社（東京都港区、代表取締役CEO 永井真之）は、傘下の株式会社アマナイメージズを通じて展開するAI学習用データソリューション「Qlean Dataset（キュリンデータセット）」において、ASR（自動音声認識）、音声理解、音声言語モデルなどの音声・言語系AI開発および研究用途に向けた「日本語・1話者・講談の音声コーパスとトランスクリプト」の提供を開始しました。

本データセットは、日本の伝統話芸である講談の語りを対象に、1名の話者が物語を語る音声と、その発話内容を忠実に書き起こした日本語トランスクリプトで構成されています。講談特有の抑揚、間、語りの速度変化を含む自然発話が連続的に収録されており、読み上げ音声や対話音声とは異なる、日本語の物語的発話構造を含んだデータとなっています。
語りは物語の進行に応じて情景説明、登場人物の語り分け、緊張感の演出などが含まれるため、音声信号とテキスト表現の対応関係を検証する用途において、単調な発話データでは得られない検証環境を提供します。長尺から短尺まで多様な語り構成を含むことで、連続音声における文脈保持やセグメンテーションを伴う研究にも利用できます。
Qlean Datasetでは、生成AI基盤モデル開発を含む研究・商用AI開発の現場で求められるデータ要件を踏まえ、権利処理および利用条件を整理した上で、本データセットを提供しています。Visual Bankは今後も、音声・言語領域における多様な日本語データの整備を通じて、AI開発および研究の基盤形成を支援していきます。

今回提供を開始する「日本語・1話者・講談の音声コーパスとトランスクリプト」の概要

「日本語・1話者・講談の音声コーパスとトランスクリプト」のユースケースイメージ

【研究用途】

- 日本語音声認識モデルにおける自然発話精度検証ASRモデルの研究において、講談の語りに含まれる抑揚や間を伴う連続音声を用い、読み上げ音声とは異なる自然発話条件下での認識精度や誤認識傾向を検証する用途に利用できます。
- 音声と言語表現の対応関係に関する研究音声信号と書き起こしテキストを組み合わせ、日本語における語り表現の構造や韻律情報が言語理解に与える影響を分析する研究用途に利用できます。

【産業用途】

- 音声入力型AIにおける長尺音声処理の検証音声検索や音声アーカイブ解析を行うAIプロダクト開発において、長時間の一人語り音声を用いた音声分割、全文書き起こし、要約処理などの機能検証に利用できます。
- 日本語音声言語モデルの事前学習・評価日本語特有の語り口や物語構造を含む音声・テキストデータとして、音声言語モデルの事前学習や評価フェーズにおける補助データとして利用できます。

『Qlean Dataset（キュリンデータセット）』について
『Qlean Dataset』は、Visual Bank傘下の株式会社アマナイメージズが提供する商用利用可能なAI学習用データソリューションです。
画像・動画・音声・3D・テキストなど、多様な形式のデータに対応し、研究・商用いずれの用途でも安全に利用できる環境を整備しています。
また、株式会社千葉ロッテマリーンズや株式会社東洋経済新報社をはじめとするデータパートナーとの協業を通じ、業界特化・最新トレンドに即したデータラインナップ『AIデータレシピ』を継続的に拡充しています。
Qlean Datasetは、AI開発現場におけるデータ収集・整備の負荷を軽減し、権利クリアで法的リスクのないAI開発環境の構築を支援します。
▶ Qlean Datasetサイト：https://qleandataset.visual-bank.co.jp/
▶ AIデータレシピ：https://qleandataset.visual-bank.co.jp/lineup

『Qlean Dataset』の提供するデータセット『AIデータレシピ』の特徴
- すべての被写体から同意取得
- 既存データは最短1日で納品可能
- カスタム撮影・収録・収集による独自データ構築にも対応

お問い合せ

Visual Bank株式会社
AI開発力を最大化する次世代型データインフラを構築・提供するスタートアップ企業として、「あらゆるデータの可能性を解き放つ」をミッションに掲げ事業活動を展開。漫画家の「もっと描きたい！」をサポートするAI補助ツールを提供する『THE PEN』の他、AI学習用データセット開発サービス『Qlean Dataset（キュリンデータセット）』を提供する株式会社アマナイメージズを100%子会社に持つ。
また、Visual Bankは国の研究開発プログラム「GENIAC」にも採択され、社会実装に向けた取り組みを加速させています。
代表取締役CEO：永井真之
所在地：〒107-0062 東京都港区南青山7-1-7 C-Cube南青山ビル6F
Visual Bank企業URL：https://visual-bank.co.jp/
アマナイメージズ企業URL：https://amanaimages.com/about/

Qlean Dataset Launches Japanese Single-Speaker Kodan Speech Corpus with Transcripts
Traditional Narrative Performance Audio and Text Data for ASR, Speech Understanding, and Language Model Research
Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai), through its subsidiary amanaimages Inc., has launched a new dataset under its AI training data solution, Qlean Dataset: the Japanese Single-Speaker Kodan Speech Corpus with Transcripts. The dataset is designed for speech- and language-based AI development and research, including Automatic Speech Recognition (ASR), speech understanding, and speech-language modeling.
This dataset consists of audio recordings of Kodan, a traditional Japanese narrative storytelling performance, delivered by a single speaker. Each recording is paired with a Japanese transcript that faithfully reflects the spoken content. The corpus captures continuous natural speech that includes expressive intonation, pauses, and variations in speaking rate characteristic of Kodan performance. Unlike read-aloud or conversational datasets, this corpus contains narrative speech structures specific to Japanese storytelling.
Because Kodan narration incorporates scene descriptions, character differentiation, and dramatic tension as the story unfolds, the dataset provides a structured environment for examining the alignment between acoustic signals and textual expression. It enables evaluation under expressive, non-monotonic speech conditions that differ from standard scripted reading datasets. With recordings ranging from short to long form, the corpus also supports research on contextual retention and segmentation in continuous speech processing.
In response to data requirements in both research and commercial AI development, including foundation model development, Qlean Dataset provides this corpus with clarified usage rights and licensing conditions. Visual Bank will continue expanding structured Japanese-language datasets in the speech and language domain to support the foundation of AI development and research.

Overview of the Japanese Single-Speaker Kodan Speech Corpus with Transcripts

Use Case Examples

Research Applications

- Evaluation of Natural Speech Recognition Accuracy in Japanese ASR ModelsThe corpus enables evaluation of ASR models using continuous narrative speech that includes expressive intonation and pauses, allowing analysis of recognition accuracy and error patterns under conditions that differ from standard read speech.
- Research on the Relationship Between Acoustic and Linguistic RepresentationBy combining audio signals with aligned transcripts, researchers can analyze how narrative structure and prosodic features in Japanese influence language understanding.

Industry Applications

- Validation of Long-Form Speech Processing in Voice-Enabled AI SystemsFor AI products involving voice search or audio archive analysis, the dataset can be used to validate speech segmentation, full transcription, and summarization performance using extended monologue recordings.
- Pre-training and Evaluation of Japanese Speech-Language ModelsAs a dataset containing narrative speech structures specific to Japanese, it can serve as supplementary data for pre-training or evaluation phases of speech-language models.

About Qlean Dataset
Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.
▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset
- Existing datasets deliverable within one business day
- Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.
Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.
CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview