Effective Date: 2025-11-19 Version: 1.0.0 Status: Active
This document defines the governance, standards, and policies for the Cognitive Archaeology Tribunal project and all datasets published using it.
Every output JSON file MUST include:
{
"metadata": {
"id": "uuid:xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx",
"type": "ai-conversations|archives|bookmarks|browser-tabs|synthesis",
"version": "1.0.0",
"schema_version": "1.0.0",
"created_at": "2025-11-19T12:00:00Z",
"updated_at": "2025-11-19T12:00:00Z",
"generator": {
"tool": "cognitive-tribunal",
"version": "0.2.0",
"module": "AIContextAggregator"
},
"provenance": {
"source_files": ["path/to/source1.json"],
"processing_steps": ["load", "parse", "analyze"],
"parameters": {}
},
"license": "CC0-1.0",
"attribution": "Optional attribution text"
},
"data": {
// Actual output data
}
}
For academic compliance, include Dublin Core elements:
{
"dublin_core": {
"dc:title": "Dataset Title",
"dc:creator": "Author Name",
"dc:subject": ["cognitive archaeology", "AI conversations"],
"dc:description": "Description of the dataset",
"dc:publisher": "Publisher Name",
"dc:contributor": ["Contributor 1"],
"dc:date": "2025-11-19",
"dc:type": "Dataset",
"dc:format": "application/json",
"dc:identifier": "uuid:...",
"dc:source": "https://github.com/...",
"dc:language": "en",
"dc:rights": "CC0-1.0",
"dc:coverage": "Temporal: 2024-01 to 2025-11"
}
}
For DOI registration via Zenodo/figshare:
{
"datacite": {
"identifier": "10.5281/zenodo.XXXXXXX",
"identifierType": "DOI",
"creators": [
{"name": "Author, Name", "orcid": "0000-0000-0000-0000"}
],
"titles": [
{"title": "Dataset Title"}
],
"publisher": "Zenodo",
"publicationYear": "2025",
"resourceType": "Dataset",
"subjects": ["Cognitive Archaeology", "Digital Humanities"],
"dates": [
{"date": "2025-11-19", "dateType": "Created"}
],
"version": "1.0.0",
"rights": [
{"rights": "CC0 1.0", "rightsURI": "https://creativecommons.org/publicdomain/zero/1.0/"}
]
}
}
All datasets, outputs, and snapshots MUST have UUIDs (RFC 4122):
Format: uuid:xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
Example: uuid:a1b2c3d4-e5f6-7890-abcd-ef1234567890
import uuid
from datetime import datetime
def generate_id():
return f"uuid:{uuid.uuid4()}"
def generate_metadata():
return {
"id": generate_id(),
"created_at": datetime.utcnow().isoformat() + "Z",
"schema_version": "1.0.0"
}
CAT-DOC-XXX - Documentation filesCAT-DATA-XXX - DatasetsCAT-SNAP-XXX - SnapshotsCAT-REL-XXX - ReleasesGOV-XXX - Governance documentsANALYSIS-XXX - Analysis reportsDefault for Code: MIT License Default for Data: CC0 1.0 Universal (Public Domain) Default for Documentation: CC-BY 4.0
Before publishing, users MUST choose a license:
./scripts/choose_license.sh
# Options:
# 1. CC0 1.0 (Public Domain)
# 2. CC-BY 4.0 (Attribution Required)
# 3. CC-BY-SA 4.0 (Attribution + ShareAlike)
# 4. CC-BY-NC 4.0 (Attribution + Non-Commercial)
# 5. ODbL (Open Database License)
# 6. Custom
# License
This dataset is released under [LICENSE NAME].
**License**: [SPDX Identifier]
**License URL**: [URL]
## Summary
[Human-readable summary of rights]
## Full Text
[Full license text or link]
## Attribution
If you use this dataset, please cite:
[Citation format]
All code files should include:
# SPDX-License-Identifier: MIT
# SPDX-FileCopyrightText: 2025 Cognitive Archaeology Tribunal Contributors
Every public dataset MUST include CONSENT.md:
# Consent Manifest
**Dataset ID**: uuid:...
**Dataset Name**: ...
**Publication Date**: 2025-11-19
## Data Sources
### AI Conversations
- [ ] All conversations are my own
- [ ] I have permission to publish shared conversations
- [ ] Sensitive information has been redacted
### Bookmarks
- [ ] All bookmarks are from my personal collection
- [ ] No proprietary/confidential URLs included
- [ ] Shared bookmark collections have permission
### Archives
- [ ] All files are my own or I have rights to publish
- [ ] No copyrighted materials without permission
- [ ] Personal information redacted
## Consent Statement
I, [NAME], affirm that:
1. I have the right to publish this data
2. I have obtained necessary permissions
3. I have redacted sensitive information
4. I release this data under [LICENSE]
**Signed**: [Name]
**Date**: 2025-11-19
Before publishing, verify:
./scripts/sanitize_for_public.shDatasets use semantic versioning: MAJOR.MINOR.PATCH
Examples:
1.0.0 - Initial release1.1.0 - Added more conversations1.1.1 - Fixed typos in metadata2.0.0 - Changed JSON schema structure# Tag dataset versions
git tag -a dataset-v1.0.0 -m "Initial public release"
git tag -a dataset-v1.1.0 -m "Added Q4 2024 conversations"
# Push tags
git push --tags
{
"version": "1.1.0",
"version_history": [
{
"version": "1.0.0",
"date": "2025-11-01",
"changes": "Initial release"
},
{
"version": "1.1.0",
"date": "2025-11-19",
"changes": "Added 50 new conversations from Nov 2025"
}
]
}
@dataset{${DATASET_ID},
author = {${AUTHOR_NAME}},
title = {${DATASET_TITLE}},
year = {${YEAR}},
publisher = {GitHub/Zenodo},
version = {${VERSION}},
doi = {${DOI}},
url = {${URL}}
}
Author, A. A. (Year). Title of dataset (Version X.X) [Data set]. Publisher. https://doi.org/XX.XXXX/XXXXX
Author, First. Year. "Title of Dataset." Version X.X. Publisher. https://doi.org/XX.XXXX/XXXXX.
./scripts/generate_citation.sh --format bibtex
./scripts/generate_citation.sh --format apa
./scripts/generate_citation.sh --format chicago
For major changes:
rfcs/All modules MUST have:
All modules MUST have:
./scripts/sanitize_for_public.shRecommended:
This governance framework may be amended through:
Version History:
1.0.0 (2025-11-19) - Initial governance frameworkIssues: https://github.com/ivi374forivi/cognitive-archaelogy-tribunal/issues Discussions: https://github.com/ivi374forivi/cognitive-archaelogy-tribunal/discussions
Document ID: GOV-FRAMEWORK-001 Status: Active License: CC-BY-4.0 Last Updated: 2025-11-19