Contributing¶

Thank you for your interest in contributing to DecentDB!

Ways to Contribute¶

Code Contributions¶

Bug fixes
New features
Performance improvements
Documentation

Non-Code Contributions¶

Bug reports
Feature requests
Documentation improvements
Test cases
Benchmarks
Translations

Getting Started¶

Fork and Clone¶

# Fork the repository on GitHub, then:
git clone https://github.com/YOUR_USERNAME/decentdb.git
cd decentdb

Set Up Development Environment¶

# Install Nim (see Building guide)
# Install dependencies
nimble install

# Build the project
nimble build

# Run tests
nimble test

Development Workflow¶

1. Create a Branch¶

git checkout -b feature/my-feature
# or
git checkout -b fix/bug-description

2. Make Changes¶

Write code
Add tests
Update documentation

3. Test Your Changes¶

# Run all tests
nimble test

# Run specific test
nim c -r tests/nim/test_your_feature.nim

# Build with strict checks
nimble build

4. Commit¶

git add .
git commit -m "type: description"

Commit message format: - feat: New feature - fix: Bug fix - docs: Documentation - test: Tests - perf: Performance - refactor: Code refactoring

Examples:

feat: Add IN operator support
fix: Handle NULL in aggregate functions
docs: Update SQL reference for JOINs
test: Add crash test for torn writes

5. Push and Create Pull Request¶

git push origin feature/my-feature

Then open a PR on GitHub with: - Clear description - What changed and why - Test results - Related issues

Code Guidelines¶

Nim Style¶

Follow NEP-1:

# Good
proc calculateTotal(items: seq[Item]): int64 =
  result = 0
  for item in items:
    result += item.price

# Bad
proc calculate_total(items:seq[Item]):int64=
result=0
for i in items:result+=i.price

Key points: - PascalCase for types - camelCase for variables, procs - 2 spaces indentation - Max 80-100 chars per line

Error Handling¶

Always check results:

let res = someOperation()
if not res.ok:
  return err[Type](res.err.code, res.err.message)

Never ignore errors in production code.

Testing¶

Every change needs tests:

suite "My Feature":
  test "basic functionality":
    let res = myFeature()
    check res.ok
    check res.value == expected

  test "error handling":
    let res = myFeature(invalid_input)
    check not res.ok
    check res.err.code == ERR_SQL

Documentation¶

Document public APIs:

## Calculate page utilization percentage
## 
## Returns 0.0-100.0 representing percentage of page space used
proc calculatePageUtilization*(tree: BTree, pageId: PageId): Result[float]

Areas Needing Help¶

High Priority¶

- [ ] Cost-based query optimizer¶

The planner currently lacks a true cost model (e.g., cardinality/selectivity estimates, costed join ordering, and costed access-path selection). High priority because it is the main unlock for closing the performance gap on multi-join and selective-filter queries without relying on manual query rewrites. - [ ] Additional SQL functions - The SQL surface area is still missing many “everyday” built-in functions (string, math, date/time, and common utility functions) that users expect for real workloads and ORMs. High priority because it reduces friction immediately and tends to be incremental work with clear, testable semantics. - [ ] Performance benchmarks - The project needs a tighter, reproducible benchmark suite (micro + macro) with stable datasets, clear baselines, and automation to catch regressions (ideally runnable in CI or in a repeatable local harness). High priority because performance work is hard to prioritize (and easy to regress) without consistent numbers and a shared methodology. - [ ] More crash test scenarios - Crash-injection coverage should expand to include more “unlucky timing” cases around WAL writing, fsync boundaries, checkpointing, and recovery paths (including partial writes/torn pages and interrupted truncation). High priority because durability/correctness claims depend on surviving these edge cases, and tests are the fastest way to prevent regressions.

Medium Priority¶

- [ ] Additional VFS implementations¶

Today the default VFS is OS-backed; additional VFS backends (e.g., in-memory for tests/benchmarks, or a constrained/portable VFS for embedded targets) would broaden where DecentDB can run and improve testability. Medium priority because it’s valuable but tends to be platform- and integration-heavy. - [ ] Query caching improvements - There’s limited reuse of work across repeated queries (e.g., prepared/bound statement reuse, plan caching, and safe result/page-level caching under snapshot isolation). Medium priority because it can yield big wins for repeated workloads, but needs careful invalidation semantics to preserve correctness. - [ ] Compression support - Storage currently writes raw pages/records/WAL frames; compression would reduce IO and file size (especially for text-heavy datasets) but requires clear choices about where compression lives (page vs record vs WAL), thresholds, and CPU tradeoffs. Medium priority because it’s impactful, but easy to get wrong without careful benchmarking and format considerations. - [ ] Better error messages - Many errors are technically correct but could be more actionable (more context like operation/object name, friendlier phrasing, and consistent codes), especially for SQL parse/bind/exec failures. Medium priority because it improves usability and debuggability, but doesn’t usually unblock core correctness/performance work.

Documentation¶

[ ] More code examples
[ ] Tutorial videos
[ ] Language bindings (Python, etc.)
[ ] Performance tuning guides

Submitting Issues¶

Bug Reports¶

Include: - DecentDB version - Operating system - Steps to reproduce - Expected vs actual behavior - Error messages - Minimal test case

Example:

**Version:** 1.7.4
**OS:** Ubuntu 22.04

**Steps:**
1. CREATE TABLE t (id INT)
2. INSERT INTO t VALUES (1)
3. SELECT * FROM t WHERE id = 2

**Expected:** Empty result
**Actual:** Returns row with id=1

**Error:** None

Feature Requests¶

Include: - Use case description - Proposed solution - Alternatives considered - Additional context

Pull Request Process¶

Ensure tests pass
```
nimble test
```
Update documentation
Add to relevant .md files
Update CHANGELOG.md
Add code comments
Follow commit conventions
Clear, descriptive messages
Reference issues: "Fixes #123"
Request review
Tag maintainers
Respond to feedback
Make requested changes
Wait for CI
All checks must pass
Reviewer approval required

Code Review¶

What We Look For¶

Correctness: Does it work? Are edge cases handled?
Tests: Are there tests? Do they cover edge cases?
Style: Does it follow Nim conventions?
Documentation: Is it documented?
Performance: Any obvious issues?

Review Process¶

Automated checks (CI)
Maintainer review
Address feedback
Final approval
Merge

Development Environment¶

Recommended Setup¶

VS Code with Nim extension
Or Vim/Neovim with nimlsp
Git with GPG signing
GitHub CLI (optional)

Useful Commands¶

# Format code
nimpretty src/*.nim

# Check for issues
nim check src/decentdb.nim

# Profile build
nim c -d:release --profiler:on src/decentdb.nim

# Memory check
valgrind --leak-check=full ./decentdb ...

Community¶

Communication¶

GitHub Issues: Bug reports, features
GitHub Discussions: Questions, ideas
Pull Requests: Code contributions

Code of Conduct¶

Be respectful
Welcome newcomers
Focus on constructive feedback
Assume good intent

Recognition¶

Contributors will be: - Listed in CONTRIBUTORS.md - Mentioned in release notes - Credited in relevant documentation

Questions?¶

Check existing issues
Read the documentation
Open a discussion
Ask in a PR comment

Thank you for contributing to DecentDB!