BitcoinTemporalGraph
<p dir="ltr">This dataset contains several files:</p><ul><li><b>dataset.tar.gz</b>: A compressed PostgreSQL database representing a graph.</li><li><b>addresses.csv</b>: A list of approximately 100,000 labeled Bitcoin addresses.<...
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | |
| Published: |
2025
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | <p dir="ltr">This dataset contains several files:</p><ul><li><b>dataset.tar.gz</b>: A compressed PostgreSQL database representing a graph.</li><li><b>addresses.csv</b>: A list of approximately 100,000 labeled Bitcoin addresses.</li></ul><h2>BitcoinTemporalGraph (dataset.tar.gz)</h2><p dir="ltr">This dataset represents a graph of value transfers between Bitcoin users. The nodes represent entities/users, and the edges represent value transfers or transactions between these entities. The graph is temporal and directed.</p><h3>Usage:</h3><ol><li>Decompress the archive: "pigz -p 10 -dc dataset.tar.gz | tar -xvf -"</li><li>Restore the tables into an existing PostgreSQL database using the <code>pg_restore</code> utility: "pg_restore -j number_jobs -Fd -O -U database_username -d database_name dataset"</li><li>Ensure substantial storage for the database: 40GB for <code>node_features</code> and 80GB for <code>transaction_edges</code> (including indexes)</li></ol><h3>Dataset Description</h3><p dir="ltr">The database contains two tables: <code>node_features</code> (approximately 252 million rows) and <code>transaction_edges</code> (approximately 785 million rows).</p><h5>Columns for <code>node_features</code> table:</h5><ul><li><b>alias</b>: Identifier of the node</li><li><b>degree</b>: Degree of the node</li><li><b>degree_in</b>: Number of incoming edges to the node</li><li><b>degree_out</b>: Number of outgoing edges from the node</li><li><b>total_transaction_in</b>: Total count of value transfers received by the node</li><li><b>total_transaction_out</b>: Total count of value transfers initiated by the node</li></ul><p dir="ltr">Amounts are expressed in satoshis (1 satoshi = 10^-8 Bitcoin):</p><ul><li><b>min_sent</b>: Minimum amount sent by the node during a transaction</li><li><b>max_sent</b>: Maximum amount sent by the node during a transaction</li><li><b>total_sent</b>: Total amount sent by the node during all transactions</li><li><b>min_received</b>: Minimum amount received by the node during a transaction</li><li><b>max_received</b>: Maximum amount received by the node during a transaction</li><li><b>total_received</b>: Total amount received by the node during all transactions</li><li><b>label</b>: Label describing the type of entity represented by the node</li></ul><p dir="ltr">Transactions on the Bitcoin network are stored in the public ledger named the "Bitcoin Blockchain". Each transaction is recorded in a block, with the block index indicating the transaction's position in the blockchain.</p><ul><li><b>first_transaction_in</b>: Block index of the first transaction received by the node</li><li><b>last_transaction_in</b>: Block index of the last transaction received by the node</li><li><b>first_transaction_out</b>: Block index of the first transaction sent by the node</li><li><b>last_transaction_out</b>: Block index of the last transaction sent by the node</li></ul><p dir="ltr">Nodes can represent one or more Bitcoin addresses (pseudonyms used by Bitcoin users). A real entity often uses multiple addresses. The dataset contains only transactions between nodes (outer transactions), but provides information about inner transactions (transactions between addresses controlled by the same node).</p><ul><li><b>cluster_size</b>: Number of addresses represented by the node</li><li><b>cluster_num_edges</b>: Number of transactions between the addresses represented by the node</li><li><b>cluster_num_cc</b>: Number of connected components in the transaction graph of the addresses represented by the node</li><li><b>cluster_num_nodes_in_cc</b>: Number of non-isolated addresses in the cluster</li></ul><h5>Columns in the <code>transaction_edges</code> table:</h5><ul><li><b>a</b>: Node alias of the sender</li><li><b>b</b>: Node alias of the recipient</li><li><b>reveal</b>: Block index of the first transaction from <code>a</code> to <code>b</code></li><li><b>last_seen</b>: Block index of the last transaction from <code>a</code> to <code>b</code></li><li><b>total</b>: Total number of transactions from <code>a</code> to <code>b</code></li><li><b>min_sent</b>: Minimum amount sent (in satoshis) in a transaction from <code>a</code> to <code>b</code></li><li><b>max_sent</b>: Maximum amount sent (in satoshis) in a transaction from <code>a</code> to <code>b</code></li><li><b>total_sent</b>: Total amount sent (in satoshis) in all transactions from <code>a</code> to <code>b</code></li></ul><h2>Dataset of Bitcoin Labeled Addresses (addresses.csv)</h2><p dir="ltr">This file contains 103,812 labeled Bitcoin addresses with the following columns:</p><ul><li><b>address</b>: Bitcoin address</li><li><b>entity</b>: Name of the entity</li><li><b>category</b>: Type of the entity (e.g., individual, bet, ransomware, gambling, exchange, mining, ponzi, marketplace, faucet, bridge, mixer)</li><li><b>source</b>: Source used to label the address</li></ul><p><br></p> |
|---|