更新时间:2023-02-23 18:59:58
更新2 添加更多关于如何使用PHP中的 phantomjs
更新1 (在说明目标页面上的JavaScript需要先运行后)
1。下载 phantomjs ,并将可执行文件放在PHP二进制文件可以访问的路径中。
2。将以下2个文件放在同一目录中:
get-website.php
<?php
$ phantom_script = dirname(__ FILE__)。 '/get-website.js';
$ response = exec('phantomjs'。$ phantom_script);
echo htmlspecialchars($ response);
?>
get-website.js
var webPage = require('webpage');
var page = webPage.create();
page.open('http://google.com/',function(status){
console.log(page.content);
phantom.exit() ;
});
3。浏览至 get-website。 php
,目标网站, http://google.com
内容将在执行内联JavaScript后返回。您也可以使用 php /path/to/get-website.php
从命令行调用此方法。
/get-website.php
<?php
$ html = file_get_contents('http://google.com');
echo $ html;
?>
test.html
<!doctype html>
< html lang =en>
< head>
< meta charset =utf-8>
< title> on demo< / title>
< style>
p {
color:red;
}
span {
color:blue;
}
< / style>
< script src =https://code.jquery.com/jquery-1.10.2.js>< / script>
< / head>
< body>
< button id ='click_me'>点击我< / button>
< span style =display:none;>< / span>
< script>
$(#click_me).click(function(){
$ .get(/ get-website.php,function(data){
var json = {
html:JSON.stringify(data),
delay:1
};
alert(json.html);
});
});
< / script>
< / body>
< / html>
Is it possible to get the content of an URL with PHP (using some sort of function like file_get_contents or header) but only after the execution of some Javascript code?
Example:
mysite.com has a script that does loadUrlAfterJavascriptExec('http://exampletogetcontent.com/')
and prints/echoes the content. imagine that some jQuery runs on http://exampletogetcontent.com/
that changes DOM, and loadUrlAfterJavascriptExec
will get the resulting HTML
Can we do that?
@Edit
Not sure if I made myself clear, but what I wanted was to get the content of a page, through an URL, but only after Javascript run on the target page (the one PHP is getting its content)
I am aware PHP runs before the page is sent to the client, and JS only after that, but mabe there was an expert workaround
Update 2 Adds more details on how to use phantomjs
from PHP.
Update 1 (after clarification that javascript on target page need to run first)
1. Download phantomjs and place the executable in a path that your PHP binary can reach.
2. Place the following 2 files in the same directory:
get-website.php
<?php
$phantom_script= dirname(__FILE__). '/get-website.js';
$response = exec ('phantomjs ' . $phantom_script);
echo htmlspecialchars($response);
?>
get-website.js
var webPage = require('webpage');
var page = webPage.create();
page.open('http://google.com/', function(status) {
console.log(page.content);
phantom.exit();
});
3. Browse to get-website.php
and the target site, http://google.com
contents will return after executing inline javascript. You can also call this from a command line using php /path/to/get-website.php
.
/get-website.php
<?php
$html=file_get_contents('http://google.com');
echo $html;
?>
test.html
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>on demo</title>
<style>
p {
color: red;
}
span {
color: blue;
}
</style>
<script src="https://code.jquery.com/jquery-1.10.2.js"></script>
</head>
<body>
<button id='click_me'>Click me</button>
<span style="display:none;"></span>
<script>
$( "#click_me" ).click(function () {
$.get("/get-website.php", function(data) {
var json = {
html: JSON.stringify(data),
delay: 1
};
alert(json.html);
});
});
</script>
</body>
</html>