且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

计算torrent文件的信息哈希

更新时间:2023-12-03 21:18:28

确保您没有换行符该文件,您可能还需要确保它以一个'e'结尾。



torrent文件的信息哈希是信息的SHA-1哈希分区(以编码形式).torrent文件。本质上,你需要解码文件(它是bencoded),并记住字节偏移量与info键相关联的值的内容开始和结束。



例如,如果这是torrent文件:

  d4:infod6:pieces20:.................... 4:name4:test12:piece lengthi1024ee8:announce27:http://  
  d6:pieces20:.................... 4:name4:test12:piece lengthi1024ee 

有关bencoding的详细信息,请参阅 BEP3


I'm using C++ to parse the info hash of a torrent file, and I am having trouble getting a "correct" hash value in comparison to this site:

http://i-tools.org/torrent

I have constructed a very simple toy example just to make sure I have the basics right.

I opened a .torrent file in sublime and stripped off everything except for the info dictionary, so I have a file that looks like this:

d6:lengthi729067520e4:name31:ubuntu-12.04.1-desktop-i386.iso12:piece lengthi524288e6:pieces27820:¡´E¶ˆØËš3í   ..............(more unreadable stuff.....)..........

I read this file in and parse it with this code:

#include <string>
#include <sstream>
#include <iomanip>
#include <fstream>
#include <iostream>

#include <openssl/sha.h>


void printHexRep(const unsigned char * test_sha) {

    std::cout << "CALLED HEX REP...PREPPING TO PRINT!\n";
    std::ostringstream os;
    os.fill('0');
    os << std::hex;
    for (const unsigned char * ptr = test_sha; ptr < test_sha + 20; ptr++) {

        os << std::setw(2) << (unsigned int) *ptr;
    }
    std::cout << os.str() << std::endl << std::endl;
}


int main() {

    using namespace std;

    ifstream myFile ("INFO_HASH__ubuntu-12.04.1-desktop-i386.torrent", ifstream::binary);

    //Get file length
    myFile.seekg(0, myFile.end);
    int fileLength = myFile.tellg();
    myFile.seekg(0, myFile.beg);

    char buffer[fileLength];

    myFile.read(buffer, fileLength);
    cout << "File length == " << fileLength << endl;
    cout << buffer << endl << endl;

    unsigned char datSha[20];
    SHA1((unsigned char *) buffer, fileLength, datSha);
    printHexRep(datSha);

    myFile.close();

    return 0;
}

Compile it like so:

g++ -o hashes info_hasher.cpp -lssl -lcrypto

And I am met with this output:

4d0ca7e1599fbb658d886bddf3436e6543f58a8b

When I am expecting this output:

14FFE5DD23188FD5CB53A1D47F1289DB70ABF31E

Does anybody know what I might be doing wrong here? Could the problem lie with the un-readability of the end of the file? Do I need to parse this as hex first or something?

Make sure you don't have a newline at the end of the file, you may also want to make sure it ends with an 'e'.

The info-hash of a torrent file is the SHA-1 hash of the info-section (in bencoded form) from the .torrent file. Essentially you need to decode the file (it's bencoded) and remember the byte offsets where the content of the value associated with the "info" key begins and end. That's the range of bytes you need to hash.

For example, if this is the torrent file:

d4:infod6:pieces20:....................4:name4:test12:piece lengthi1024ee8:announce27:http://tracker.com/announcee

You wan to just hash this section:

d6:pieces20:....................4:name4:test12:piece lengthi1024ee

For more information on bencoding, see BEP3.