JUHE API Marketplace
DATASET
Open Source Community

orbiter/bundestag_gesetze_index_bulk_20240507

The dataset named Deutsche Bundesgesetze und -verordnungen contains Elasticsearch bulk‑format index files of German federal laws and regulations. These files were generated using a tool called bundestag_gesetze_parser, with source data from https://www.gesetze-im-internet.de/. The primary use of the dataset is as a foundation for Retrieval‑Augmented Generation (RAG) combined with large language models (LLMs). The README also provides detailed steps for importing the data into Elasticsearch and YaCy.

Updated 5/14/2024
hugging_face

Description

Dataset Overview

Basic Information

  • License: cc0-1.0
  • Task Category: Text Generation
  • Language: German
  • Tags: Legal
  • Size Category: 100K<n<1M

Dataset Content

Use Cases

  • Can be used for RAG (Retrieval Augmented Generation) combined with large language models.
  • Recommended to use Elasticsearch for full‑text indexing or embedding‑based semantic indexing.

Dataset Import

  • Elasticsearch import: Start an Elasticsearch container with Docker and import the dataset via a series of curl commands.
  • YaCy import: Data can also be imported via YaCy; detailed steps are described on the YaCy forum.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Legal Data
Data Indexing

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.