ML: Datasets & Models

Category: Machine Learning Resources
Context: This page serves as a centralized hub for datasets and models developed for NLP research, specifically in the domains of complex reasoning and literary analysis.

Core Datasets

1. ExistenceTypes (Complex Reasoning)

A specialized dataset focused on multi-step reasoning and contextual understanding. It explores the distinction between godless and godliving entities through logic puzzles and classification tasks.

Size: 269 samples
Primary Task: Text Classification (Godless/Godliving/Mixed)
Secondary Tasks: Question Answering, Thematic Validation
Access: HuggingFace Dataset
Associated Model: ExistenceTypes Analysis Model (90+ implementations)

2. PoeticDevices (Creative Writing)

A comprehensive collection of poetic devices and their applications, featuring annotated examples across various literary techniques.

Size: 1,796 samples
Categories: 33 poetic devices
Themes: 36 distinct types
Access: HuggingFace Dataset

3. GreekMythos

Provides structured data for mythological analysis and narrative understanding, focusing on character relationships and thematic patterns.

Type: Text Analysis
Focus: Mythological Studies
Access: HuggingFace Dataset

4. Xibirisms

A linguistic dataset exploring unique language patterns and expressions, derived from the Xibirisms poetry-prose epic.

Type: Linguistic Analysis
Format: Parquet
Access: HuggingFace Dataset

Technical Applications

Our datasets are designed to work together, providing a comprehensive framework for natural language understanding across multiple domains. By bridging the gap between mythology, linguistics, and logical reasoning, these resources enable the development of more nuanced and context-aware AI models.

Licensing

All datasets listed here are licensed under CC BY-NC-SA 4.0. They are free for research and non-commercial use with appropriate attribution.