-
201
Performance.
Published 2025“…In this work, we introduce a new Python package called RNACOREX (RNA CORegulatory network EXplorer and classifier). …”
-
202
Building footprtints from 1970s Hexagon spy satellite images for four global urban growth hotspots
Published 2025“…</p> <p><strong>Processing environment</strong></p> <p>This research has been conducted using Python for ESRI ArcGIS Pro version 3.2.1 and the TensorFlow package. …”
-
203
DevCMG: Developer-Centric Automated Commit Message Generation
Published 2025“…<br><br>---<br><br>## **Datasets**<br>The `dataset` folder contains all 2,683 commits used in this study, covering the following five programming languages:<br>- C++<br>- C#<br>- Java<br>- JavaScript<br>- Python<br><br>---<br><br>## **Baselines**<br>### **State-of-the-Art (SOTA):**<br>- KADEL<br>- OMEGA<br>- DeepSeek-V3<br><br>### **Other Tools:**<br>- GPT-3.5<br>- CmtGen<br>- CoRec<br>- NMT<br>- NNGen<br>- Ptr-net<br><br>---<br><br>## **Experiments**<br><br>### **RQ1: Effectiveness of DevCMG**<br>Commands to run the experiments:<br>python message_generation.py<br>python llm_judge_metrics.py<br><br>## **RQ2: Ablation Study Results**<br><br>| **Approach** | **Reasonableness** | **Comprehensiveness** | **Succinctness** | **Normativity** | **Weighted Average** |<br>|-------------------------------|--------------------|-----------------------|------------------|-----------------|-----------------------|<br>| Baseline | 3.36 | 2.99 | 3.32 | 2.18 | 2.9445 |<br>| Without behavior clustering | 3.39 | 3.37 | 2.61 | 2.22 | 2.919 |<br>| Without CCS classification | 3.59 | 3.18 | 3.33 | 3.20 | 3.3725 |<br>| **Our approach** | **3.99** | **3.91** | **3.65** | **3.87** | **3.891** |<br><br>---<br><br># **RQ3: Rankings from Different Evaluators**<br><br>| **Evaluator** | **Ranking** |<br>|-------------------|------------------------------------------------------------------------------------------------------------|<br>| **Gemini-2.5** | DevCMG, Zero-Shot, GPT-3.5, OMEGA, KADEL, NNgen, Ptr-net, CmtGen, CoRec, NMT |<br>| **GPT-4o** | DevCMG, Zero-Shot, OMEGA, GPT-3.5, NNgen, KADEL, Ptr-net, CoRec, CmtGen, NMT |<br>| **DeepSeek-R1** | DevCMG, Zero-Shot, OMEGA, GPT-3.5, KADEL, NNgen, Ptr-net, CoRec, CmtGen, NMT |<br>| **Qwen-3** | DevCMG, Zero-Shot, GPT-3.5, OMEGA, KADEL, NNgen, Ptr-net, CoRec, NMT, CmtGen |<br>| **ChatGLM-4** | DevCMG, OMEGA, Zero-Shot, GPT-3.5, NNgen, KADEL, Ptr-net, CoRec, CmtGen, NMT |<br>| **Human** | DevCMG, Zero-Shot, OMEGA, GPT-3.5, NNgen, KADEL, Ptr-net, CoRec, CmtGen, NMT |<br><br>## **RQ4: User Study Results**<br><br>The user study results are available in the `/experiments/RQ4` folder. …”
-
204
AGU24 - EP11D-1300 - Revisiting Megacusp Embayment Occurrence in Monterey Bay and Beyond: High Spatiotemporal Resolution Satellite Imagery Provides New Insight into the Wave Condit...
Published 2025“…D., Vos, K., & Splinter, K. D. (2022). A Python toolkit to monitor sandy shoreline change using high-resolution PlanetScope cubesats. …”
-
205
Data from: Dairy cows inoculated with highly pathogenic avian influenza virus H5N1
Published 2024“…The pipeline uses Python v3.10, R v4.4 (R Development Core Team 2024), and SnakeMake to organize programs and script execution. …”
-
206
Methodological Approach Based on Structural Parameters, Vibrational Frequencies, and MMFF94 Bond Charge Increments for Platinum-Based Compounds
Published 2025“…The developed bci optimization tool, based on MMFF94, was implemented using a Python code made available at https://github.com/molmodcs/bci_solver. …”
-
207
The perceived wealth and physical disorder scores prediction dataset for urban China
Published 2025“…They can be processed using GIS software such as ArcGIS and QGIS, as well as Python programming language packages such as Rasterio. …”
-
208
Analysis of Sensing Modalities for Electrode-Induction Gas Atomization of Metal Powders: Vibrometer and Acoustic Emission Data (TDMS)
Published 2025“…Both sensors were recorded with a PXI system and a custom LABVIEW program.</p><p dir="ltr">The files can be read either in Python with the <a href="https://nptdms.readthedocs.io/en/stable/" rel="noreferrer" target="_blank">nptdms </a>or just with Matlab.…”
-
209
HISTORECO: Historical Spanish transition database on climate, geography, and economics of the 20th-21st Century
Published 2025“…</p><p dir="ltr">The dataset combines information from twenty sources (databases/articles), harmonizing and downscaling them to the municipal level using GIS and programming tools (mainly QGIS, R, and Python). …”
-
210
Data and code for: A century of reforestation reduced anthropogenic warming in the eastern United States
Published 2025“…</p><p>All the data can be processed using open-source programs such as R or Python. </p>…”
-
211
Global Aridity Index and Potential Evapotranspiration (ET0) Database: Version 3.1
Published 2025“…</p><p dir="ltr">The Python programming source code used to run the calculation of ET0 and AI is provided and available online on Figshare at:</p><p dir="ltr">https://figshare.com/articles/software/Global_Aridity_Index_and_Potential_Evapotranspiration_Climate_Database_v3_-_Algorithm_Code_Python_/20005589</p><p dir="ltr">Peer-Review Reference and Proper Citation:</p><p dir="ltr">Zomer, R.J.; Xu, J.; Trabuco, A. 2022. …”
-
212
ChatbotDjango
Published 2025“…</li></ul><h3>Technologies Commonly Used:</h3><ul><li>Programming languages: Python, JavaScript</li><li>NLP libraries/frameworks: spaCy, NLTK, Rasa, Dialogflow, BERT, GPT</li><li>Integration platforms: Telegram Bot API, Facebook Messenger, Web-based chat UI</li></ul><p></p>…”
-
213
PepENS
Published 2025“…To run the tool, execute the script PepENS_user.py (located in the Dataset1 directory). Before using the tool, users must generate three types of features: the PSSM (see <a href="https://www.ncbi.nlm.nih.gov/books/NBK2590/" rel="nofollow" target="_blank">PSI-BLAST tutorial</a>), the Transformer embeddings (see <a href="https://github.com/agemagician/ProtTrans" target="_blank">ProtTrans usage guide</a>), and the HSE features (via <a href="https://github.com/biopython/biopython/blob/master/Scripts/PDB/hsexpo.py" target="_blank">hsexpo program</a> with Exposure Types CN, HSEBD, and HSEBU). …”
-
214
LAURA: Enhancing Code Review Generation with Context-Enriched Retrieval-Augmented LLM
Published 2025“…The dataset section contains 301k entries from 1,807 high-quality projects sourced from GitHub, covering four programming languages: C, C++, Java, and Python. We also provide the time-split dataset used as the retrieval database (which is also used for fine-tuning CodeReviewer) and the human-annotated evaluation dataset.…”
-
215
MSc Personalised Medicine at Ulster University
Published 2025“…Both full-time and part-time programmes have two intakes and can be started in September or January.…”
-
216
Automatically Generated Chemical KG
Published 2025“…The files are in JSON format and are intended to be loaded within Python as dictionaries. The <i>Full_SSKG.json </i>file is approximately 11GB in size when extracted. …”
-
217
-
218
GeoGraphNetworks: Shapefile-Derived Datasets for Accurate and Scalable Graphical Representations
Published 2025“…</p><p dir="ltr">The JSON files contain graph objects created using the widely used Python library NetworkX, allowing for immediate use without the need for pre-processing. …”
-
219
USDA-ARS Tucson, Arizona 2014-2022 Data Reservoir of Field Experiments with Managed Honey Bee Colonies: Annotated Hive Frame Photos: Dataset I
Published 2025“…</p><p dir="ltr">SRC/ also contains the following Python scripts that we used for training YOLO networks:</p><p dir="ltr">(a) train_valid_split.py -- splits all alldata.txt in USDA_ARZ_DATA_YOLO_19june2025.zip into train.txt and valid.txt for YOLO training.…”
-
220
Behavioural machine activity for benign and malicious Win7 64-bit executables
Published 2024“…</li></ul><p><br></p><p><strong>Dataset 2:</strong></p><ul><li>filename = "data_2.csv"</li><li>2345 benign samples </li><li>2286 malicious samples</li><li>Up to 20 seconds execution per file</li><li>The data was collected in a VirtualBox[1] virtual machine using Cuckoo Sandbox[2] with a custom package written in the python library, Psutil[4] to collect the machine activity data. …”