且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

在Javascript使用PHP运行后获取URL的内容(文本)

更新时间:2023-02-23 18:59:58

更新2 添加更多关于如何使用PHP中的 phantomjs

更新1 (在说明目标页面上的JavaScript需要先运行后)



方法1:使用 phantomjs (将执行javascript);



1。下载 phantomjs ,并将可执行文件放在PHP二进制文件可以访问的路径中。



2。将以下2个文件放在同一目录中:



get-website.php

 <?php 

$ phantom_script = dirname(__ FILE__)。 '/get-website.js';


$ response = exec('phantomjs'。$ phantom_script);

echo htmlspecialchars($ response);
?>

get-website.js

  var webPage = require('webpage'); 
var page = webPage.create();

page.open('http://google.com/',function(status){
console.log(page.content);
phantom.exit() ;
});

3。浏览至 get-website。 php ,目标网站, http://google.com 内容将在执行内联JavaScript后返回。您也可以使用 php /path/to/get-website.php 从命令行调用此方法。



方法2:使用Ajax与PHP(无phantomjs,因此不会运行javascript);



/get-website.php

 <?php 

$ html = file_get_contents('http://google.com');
echo $ html;
?>

test.html

 <!doctype html> 
< html lang =en>
< head>
< meta charset =utf-8>
< title> on demo< / title>
< style>
p {
color:red;
}
span {
color:blue;
}
< / style>
< script src =https://code.jquery.com/jquery-1.10.2.js>< / script>
< / head>
< body>
< button id ='click_me'>点击我< / button>
< span style =display:none;>< / span>
< script>

$(#click_me).click(function(){
$ .get(/ get-website.php,function(data){
var json = {
html:JSON.stringify(data),
delay:1
};
alert(json.html);
});
});
< / script>
< / body>
< / html>


Is it possible to get the content of an URL with PHP (using some sort of function like file_get_contents or header) but only after the execution of some Javascript code?

Example:

mysite.com has a script that does loadUrlAfterJavascriptExec('http://exampletogetcontent.com/') and prints/echoes the content. imagine that some jQuery runs on http://exampletogetcontent.com/ that changes DOM, and loadUrlAfterJavascriptExec will get the resulting HTML

Can we do that?

@Edit

Not sure if I made myself clear, but what I wanted was to get the content of a page, through an URL, but only after Javascript run on the target page (the one PHP is getting its content)

I am aware PHP runs before the page is sent to the client, and JS only after that, but mabe there was an expert workaround

Update 2 Adds more details on how to use phantomjs from PHP.

Update 1 (after clarification that javascript on target page need to run first)

Method 1:Use phantomjs(will execute javascript);

1. Download phantomjs and place the executable in a path that your PHP binary can reach.

2. Place the following 2 files in the same directory:

get-website.php

<?php

    $phantom_script= dirname(__FILE__). '/get-website.js'; 


    $response =  exec ('phantomjs ' . $phantom_script);

    echo  htmlspecialchars($response);
    ?>

get-website.js

var webPage = require('webpage');
var page = webPage.create();

page.open('http://google.com/', function(status) {
 console.log(page.content);
  phantom.exit();
});

3. Browse to get-website.php and the target site, http://google.com contents will return after executing inline javascript. You can also call this from a command line using php /path/to/get-website.php.

Method 2:Use Ajax with PHP (No phantomjs so won't run javascript);

/get-website.php

<?php

    $html=file_get_contents('http://google.com');
    echo $html;
    ?>

test.html

<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>on demo</title>
<style>
p {
color: red;
}
span {
color: blue;
}
</style>
<script src="https://code.jquery.com/jquery-1.10.2.js"></script>
</head>
<body>
<button id='click_me'>Click me</button>
<span style="display:none;"></span>
<script>

$( "#click_me" ).click(function () {
    $.get("/get-website.php", function(data) {
        var json = {
            html: JSON.stringify(data),
            delay: 1
        };
        alert(json.html);
        });
});
</script>
</body>
</html>