KNOWLEDGE BASE

How to reload all data to show the all data distributions in preview in Tableau Prep Builder without reopening the flow


Published: 28 Jan 2023
Last Modified Date: 30 Jan 2023

Question

How to reload the input data to show the data distributions of all data in preview in Tableau Prep Builder without reopening the flow?

When openning a flow which has more than 1 million rows (for this example, it is 5.73 million rows) and has been set to [Use all data] under [Data Sample] pane on Input Step, at this time we can see the data distributions of all data in preview of clean step like below.
User-added image

After start editing the flow, the data distributions in preview of clean step is limited to 1 million size like below.
User-added image


However, sometimes we want to review the data distributions of all data in preview of clean step to confirm if our flow is edited as expected.
Is there any way to review the data distributions of all data in preview of clean step again without reopening the flow?

Environment

  • Tableau Prep Builder

Answer

There is a workaround to reload the all data again without reopening the flow.
Note: Please save your flow editing work before you try the workaround since to using [
Use all data] can impact performance or cause Tableau Prep Builder to time out.
=====
1. In Tableau Prep Builder, at Input Step, confirm your current [Data Sample] setting is [
Use all data].
User-added image

2. In the same Input Step, change [Data Sample] setting to [
Fixed number of rows], and keep the row limit value as blank.
User-added image

3. In the same Input Step, change [Sampling method] from current one to another one.
In this example, change it from [Random sample] to [Quick select].

User-added image

4. Confirm the [Data Sample] setting is returned back to [Use all data] automatically.
Note: By design, when changing [Sampling method] with the row limit value as blank, this setting is invalid and returned back to previous [Data Sample] setting. 

User-added image

5. Return to the clean step which you want to review the data distributions of all data, then wait till the data loading is finished.
User-added image

6. Confirm the data distributions shows all data size again.
User-added image

Additional Information

If you just want to review a data distributions which is extremely similar to the one of all data, you also can consider set [Data Sample] setting to [Fixed number of rows] with the row limit value as 1 million, then select [Sampling method] as [Random sample].
User-added image

Here is the comparison of the
 data distributions of the limited data ([Fixed number of rows] :1,000,000 with Random sample]) and the one of all data.
The data distributions of the limited data ([Fixed number of rows] :1,000,000 with Random sample]):
User-added image

The data distributions of all data:
User-added image
Did this article resolve the issue?