Providing Data
Suggested Storage Locations
Data should be hosted in cloud-accessible object storage, preferably:
- AWS S3 (preferred)
- Example path:
s3://veda-data-store-staging/<instance>/<dataset>/
- Example path:
- Other acceptable options:
- Google Cloud Storage (GCS)
- Azure Blob Storage
- Google Cloud Storage (GCS)
Best Practices
- Ensure data is publicly accessible or that permissions are correctly configured for the VEDA team
- Follow open data standards where applicable
- Provide complete and consistent metadata with your dataset
- Include citation information as part of the submission
Ensuring Open Data Status
Data included in VEDA instances should be openly accessible. Confirm that:
- The dataset has no access restrictions that would prevent public use
- A clear license is provided alongside the dataset (e.g., CC-BY, public domain)
Achieving Data Citation
Provide citation information with your dataset submission, including:
- Dataset title and version
- Authors or originating organization
- DOI or persistent identifier (if available)
- Temporal and spatial coverage
- Reference or landing page (if available)