ML: Datasets & Models
Category: Machine Learning Resources
Context: This page serves as a centralized hub for datasets and models developed for NLP research, specifically in the domains of complex reasoning and literary analysis.
Core Datasets
1. ExistenceTypes (Complex Reasoning)
A specialized dataset focused on multi-step reasoning and contextual understanding. It explores the distinction between godless and godliving entities through logic puzzles and classification tasks.
- Size: 269 samples
- Primary Task: Text Classification (Godless/Godliving/Mixed)
- Secondary Tasks: Question Answering, Thematic Validation
- Access: HuggingFace Dataset
- Associated Model: ExistenceTypes Analysis Model (90+ implementations)
2. PoeticDevices (Creative Writing)
A comprehensive collection of poetic devices and their applications, featuring annotated examples across various literary techniques.
- Size: 1,796 samples
- Categories: 33 poetic devices
- Themes: 36 distinct types
- Access: HuggingFace Dataset
3. GreekMythos
Provides structured data for mythological analysis and narrative understanding, focusing on character relationships and thematic patterns.
- Type: Text Analysis
- Focus: Mythological Studies
- Access: HuggingFace Dataset
4. Xibirisms
A linguistic dataset exploring unique language patterns and expressions, derived from the Xibirisms poetry-prose epic.
- Type: Linguistic Analysis
- Format: Parquet
- Access: HuggingFace Dataset
Technical Applications
Our datasets are designed to work together, providing a comprehensive framework for natural language understanding across multiple domains. By bridging the gap between mythology, linguistics, and logical reasoning, these resources enable the development of more nuanced and context-aware AI models.
Licensing
All datasets listed here are licensed under CC BY-NC-SA 4.0. They are free for research and non-commercial use with appropriate attribution.