Add support for previewing parquet files
acknowledged
I
Indigo Wildcat
Preview is currently supported for the following file types:
- Nextflow output files ( .command.*,.fusion.*and.exitcode)
- Text
- CSV and TSV
- HTML
- Images (JPG, PNG, SVG, etc.)
Parquet files are a very common columnar storage format for datasets and preview for this file type should be supported.
M
Mysterious Mongoose
Hi suggest to have DUCKDB Wasm it review big parquet files (this won't blow the browser memory and give nice sql like exploration
Rob Newman
marked this post as
acknowledged
C
Charcoal Mandrill
I had this issue recently too and here is a trick I used to get around it; wrap your file in HTML and save a copy as .html, and then register it in the tower.yml
You can use a script like this;
#!/bin/bash
# wrap an input JSON file with HTML so it loads nicely in web browsers
set -euxo pipefail
INPUT_JSON="${1}"
OUTPUT_HTML="${2}"
cat << EOF > "$OUTPUT_HTML"
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<title>AWS Details Page</title>
<style>
#json-container {
width: 100%;
height: 80vh; /* adjust height as needed */
padding: 10px;
border: 1px solid #ccc;
background: #f9f9f9;
font-family: monospace;
white-space: pre; /* respect line breaks */
overflow: auto; /* enable scrolling */
box-sizing: border-box;
}
</style>
</head>
<body>
<div id="json-container">
EOF
cat "$INPUT_JSON" >> "$OUTPUT_HTML"
cat << EOF >> "$OUTPUT_HTML"
</div>
</body>
</html>
EOF
the script here references JSON but any text based file will work
try not to blow up the browser on huge files
hopefully Seqera Platform can come up with a smarter way to do this natively :)
Rob Newman Rob Syme
Rob Newman
marked this post as
acknowledged