-
201
A multi-hop example from the BibSQL dataset.
منشور في 2025"…We further refined SQL generation using a Program-of-Thoughts (PoT) prompting strategy, which guides the LLM to produce more accurate output by first creating Python pseudocode. …"
-
202
Detailed statistics of the BibSQL dataset.
منشور في 2025"…We further refined SQL generation using a Program-of-Thoughts (PoT) prompting strategy, which guides the LLM to produce more accurate output by first creating Python pseudocode. …"
-
203
Tasiyagnunpa Migration 2023
منشور في 2025"…<p dir="ltr">This data snippet contains:</p><ol><li>World Wildlife Fund (WWF) ecoregions</li><li>Occurrence data from the Global Biodiversity Information Facility (GBIF) on Tasiyagnunpa (Lakota name for the western meadowlark) in 2023, accessed with the pygbif library</li></ol><p dir="ltr">Data used in the ESIIL Stars program migration coding challenge and published online on the ESIIL Environmental Data Science Learning Portal.…"
-
204
Veery Migration 2023
منشور في 2025"…<p dir="ltr">This data snippet contains:</p><ol><li>World Wildlife Fund (WWF) ecoregions for the Veery </li><li>Occurrence data from the Global Biodiversity Information Facility (GBIF) on Veery thrushes (catharus fuscescens) in 2023, accessed with the pygbif library</li></ol><p dir="ltr">Data used in the University of Colorado Boulder Earth Data Analytics -- Foundations graduate certificate program migration coding challenge and published online on the ESIIL Environmental Data Science Learning Portal.…"
-
205
RNACOREX package workflow.
منشور في 2025"…In this work, we introduce a new Python package called RNACOREX (RNA CORegulatory network EXplorer and classifier). …"
-
206
Performance.
منشور في 2025"…In this work, we introduce a new Python package called RNACOREX (RNA CORegulatory network EXplorer and classifier). …"
-
207
LAURA: Enhancing Code Review Generation with Context-Enriched Retrieval-Augmented LLM
منشور في 2025"…The dataset section contains 301k entries from 1,807 high-quality projects sourced from GitHub, covering four programming languages: C, C++, Java, and Python. We also provide the time-split dataset used as the retrieval database (which is also used for fine-tuning CodeReviewer) and the human-annotated evaluation dataset.…"
-
208
Data from: Dairy cows inoculated with highly pathogenic avian influenza virus H5N1
منشور في 2024"…The pipeline uses Python v3.10, R v4.4 (R Development Core Team 2024), and SnakeMake to organize programs and script execution. …"
-
209
AGU24 - EP11D-1300 - Revisiting Megacusp Embayment Occurrence in Monterey Bay and Beyond: High Spatiotemporal Resolution Satellite Imagery Provides New Insight into the Wave Condit...
منشور في 2025"…D., Vos, K., & Splinter, K. D. (2022). A Python toolkit to monitor sandy shoreline change using high-resolution PlanetScope cubesats. …"
-
210
Methodological Approach Based on Structural Parameters, Vibrational Frequencies, and MMFF94 Bond Charge Increments for Platinum-Based Compounds
منشور في 2025"…The developed bci optimization tool, based on MMFF94, was implemented using a Python code made available at https://github.com/molmodcs/bci_solver. …"
-
211
Building footprtints from 1970s Hexagon spy satellite images for four global urban growth hotspots
منشور في 2025"…</p> <p><strong>Processing environment</strong></p> <p>This research has been conducted using Python for ESRI ArcGIS Pro version 3.2.1 and the TensorFlow package. …"
-
212
GeoGraphNetworks: Shapefile-Derived Datasets for Accurate and Scalable Graphical Representations
منشور في 2025"…</p><p dir="ltr">The JSON files contain graph objects created using the widely used Python library NetworkX, allowing for immediate use without the need for pre-processing. …"
-
213
Behavioural machine activity for benign and malicious Win7 64-bit executables
منشور في 2024"…</li></ul><p><br></p><p><strong>Dataset 2:</strong></p><ul><li>filename = "data_2.csv"</li><li>2345 benign samples </li><li>2286 malicious samples</li><li>Up to 20 seconds execution per file</li><li>The data was collected in a VirtualBox[1] virtual machine using Cuckoo Sandbox[2] with a custom package written in the python library, Psutil[4] to collect the machine activity data. …"
-
214
Analysis of Sensing Modalities for Electrode-Induction Gas Atomization of Metal Powders: Vibrometer and Acoustic Emission Data (TDMS)
منشور في 2025"…Both sensors were recorded with a PXI system and a custom LABVIEW program.</p><p dir="ltr">The files can be read either in Python with the <a href="https://nptdms.readthedocs.io/en/stable/" rel="noreferrer" target="_blank">nptdms </a>or just with Matlab.…"
-
215
PepENS
منشور في 2025"…To run the tool, execute the script PepENS_user.py (located in the Dataset1 directory). Before using the tool, users must generate three types of features: the PSSM (see <a href="https://www.ncbi.nlm.nih.gov/books/NBK2590/" rel="nofollow" target="_blank">PSI-BLAST tutorial</a>), the Transformer embeddings (see <a href="https://github.com/agemagician/ProtTrans" target="_blank">ProtTrans usage guide</a>), and the HSE features (via <a href="https://github.com/biopython/biopython/blob/master/Scripts/PDB/hsexpo.py" target="_blank">hsexpo program</a> with Exposure Types CN, HSEBD, and HSEBU). …"
-
216
USDA-ARS Tucson, Arizona 2014-2022 Data Reservoir of Field Experiments with Managed Honey Bee Colonies: Annotated Hive Frame Photos: Dataset I
منشور في 2025"…</p><p dir="ltr">SRC/ also contains the following Python scripts that we used for training YOLO networks:</p><p dir="ltr">(a) train_valid_split.py -- splits all alldata.txt in USDA_ARZ_DATA_YOLO_19june2025.zip into train.txt and valid.txt for YOLO training.…"
-
217
-
218
Automatically Generated Chemical KG
منشور في 2025"…The files are in JSON format and are intended to be loaded within Python as dictionaries. The <i>Full_SSKG.json </i>file is approximately 11GB in size when extracted. …"
-
219
100-m resolution Age-Stratified Population Estimation from the 2020 China Census by Township (ASPECT)
منشور في 2024"…Once uncompressed, the GEOTIFF files can processed by GIS software such as ArcGIS, and by programing language packages such as Rasterio in Python.…"
-
220
<b>Visium spatial transcriptomics data for individual oral squamous cell carcinoma (OSCC) patients</b>
منشور في 2025"…</li><li>Supplementary spatial metadata components (pickled Python objects) for use with the integrated AnnData object are available at Figshare (<a href="https://doi.org/10.6084/m9.figshare.20408067" rel="noopener" target="_blank">https://doi.org/10.6084/m9.figshare.20408067</a>).…"