# Page Not Found

The URL `avoiding_shuffle_less_stage-_more_fast` does not exist. This page may have been moved, renamed, or deleted.

## Suggested Pages

You may be looking for one of the following:
- [Avoiding Shuffle "Less stage, run faster"](https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/rdd/avoiding_shuffle_less_stage-_more_fast.md)
- [Don’t collect large RDDs](https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/rdd/dont_collect_large_rdds.md)
- [Use the Best Data Format](https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/storage/use-the-best-data-format.md)
- [Joining a large and a small Dataset](https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/dataframe/joining-a-large-and-a-medium-size-dataset.md)
- [How to estimate the size of a Dataset](https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/parallelism/sparksqlshufflepartitions_draft.md)

## How to find the correct page

If the exact page cannot be found, you can still retrieve the information using the documentation query interface.

### Option 1 — Ask a question (recommended)

Perform an HTTP GET request on the documentation index with the `ask` parameter:

```
GET https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/rdd/avoiding_shuffle_less_stage-_more_fast.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

### Option 2 — Browse the documentation index

Full index: https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/sitemap.md

Use this to discover valid page paths or navigate the documentation structure.

### Option 3 — Retrieve the full documentation corpus

Full export: https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/llms-full.txt

Use this to access all content at once and perform your own parsing or retrieval. It will be more expensive.

## Tips for requesting documentation

Prefer `.md` URLs for structured content, append `.md` to URLs (e.g., `/apache-spark-best-practices-and-tuning/rdd/avoiding_shuffle_less_stage-_more_fast.md`).

You may also use `Accept: text/markdown` header for content negotiation.