Update README.md with Gretel's Synthetic Multilingual Prompts Dataset

Inspired by the valuable work in this repository and dataset, Gretel has created a synthetic multilingual dataset of prompts. This dataset, released under the permissive Apache 2.0 license, includes 1,250 synthetic LLM prompts generated using Gretel Navigator, available in seven different languages. It is designed to be used with LLMs to generate diverse and multilingual responses based on the provided prompts. We believe this contribution will further enhance and support the efforts made here by providing additional resources for the community to explore, utilize, and build upon.
This commit is contained in:
mvansegbroeck 2024-06-12 13:38:55 +02:00 committed by GitHub
parent d56ff0e90f
commit 20f405042c
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
1 changed files with 11 additions and 0 deletions

View File

@ -96,6 +96,17 @@ The _unofficial_ ChatGPT desktop application provides a convenient way to access
# Prompts
## Synthetic Multilingual LLM Prompts
Contributed by: [Gretel](https://gretel.ai)
Reference: [https://huggingface.co/datasets/gretelai/synthetic_multilingual_llm_prompts](https://huggingface.co/datasets/gretelai/synthetic_multilingual_llm_prompts)
- The "Synthetic Multilingual LLM Prompts" dataset features 1,250 synthetic LLM prompts generated using [Gretel Navigator](https://gretel.ai/navigator), available in seven different languages.
- To ensure accuracy, diversity, and maintain translation quality and consistency, we employed Gretel Navigator both as a generation tool and in an LLM-as-a-judge approach. More info can be found in the [README](https://huggingface.co/datasets/gretelai/synthetic_multilingual_llm_prompts).
- This dataset is designed to be used with LLMs to generate diverse and multilingual responses based on the provided prompts.
- The dataset includes prompts in English, Dutch, French, Spanish, German, Brazilian Portuguese, and Simplified Chinese. Detailed evaluations of translation quality are available for each language.
- This dataset is released under the Apache 2.0 license, making it open for public use with proper attribution. We invite the community to explore, utilize, and contribute to this dataset to enhance the versatility and richness of LLM interactions.
- Disclaimer: The translations and overall quality of this dataset are generated synthetically and have not been perfected by human review. As a result, inaccuracies may be present.
## ChatGPT SEO prompts
Contributed by: [StoryChief AI](https://www.storychief.io/ai-power-mode)
Reference: [https://storychief.io/blog/chatgpt-prompts-seo](https://storychief.io/blog/chatgpt-prompts-seo)